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To Professor Mark Kac 


Preface to the second edition 


The most important revisions in this edition are: (1) enlargement of the 
treatment of p-adic functions in Chapter IV to include the Iwasawa logarithm 
and the p-adic gamma-function, (2) rearrangement and addition of some 
exercises, (3) inclusion of an extensive appendix of answers and hints to the 
exercises, the absence of which from the first edition was apparently a source 
of considerable frustration for many readers, and (4) numerous corrections 
and clarifications, most of which were proposed by readers who took the 
trouble to write me. Some clarifications in Chapters IV and V were also 
suggested by V. V. Shokurov, the translator of the Russian edition. I am 
grateful to all of these readers for their assistance. I would especially like to 
thank Richard Bauer and Keith Conrad, who provided me with systematic 
lists of misprints and unclarities. 


I would also like to express my gratitude to the staff of Springer-Verlag 
for both the high quality of their production and the cooperative spirit with 


which they have worked with me on this book and on other projects over the 
past several years. 


Seattle, Washington N.LK. 
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Preface to the first edition 


These lecture notes are intended as an introduction to p-adic analysis on the 
elementary level. For this reason they presuppose as little background as possi- 
ble. Besides about three semesters of calculus, I presume some slight exposure to 
more abstract mathematics, to the extent that the student won’t have an adverse 
reaction to matrices with entries in a field other than the real numbers, field 
extensions of the rational numbers, or the notion of a continuous map of topolog- 
ical spaces. 

The purpose of this book is twofold: to develop some basic ideas of p-adic 
analysis, and to present two striking applications which, it is hoped, can be as 
effective pedagogically as they were historically in stimulating interest in the 
field. The first of these applications is presented in Chapter II, since it only 
requires the most elementary properties of Q@,; this is Mazur’s construction by 
means of p-adic integration of the Kubota— Leopoldt p-adic zeta-function, which 
*‘p-adically interpolates’’ the values of the Riemann zeta-function at the negative 
odd integers. My treatment is based on Mazur’s Bourbaki notes (unpublished). 
The book then returns to the foundations of the subject, proving extension of the 
p-adic absolute value to algebraic extensions of Q,, constructing the p-adic 
analogue of the complex numbers, and developing the theory of p-adic power 
series. The treatment highlights analogies and contrasts with the familiar con- 
cepts and examples from calculus. The second main application, in Chapter V, is 
Dwork’s proof of the rationality of the zeta-function of a system of equations 
over a finite field, one of the parts of the celebrated Weil Conjectures. Here the 
presentation follows Serre’s exposition in Séminaire Bourbaki. 

These notes have no pretension to being a thorough introduction to p-adic 
analysis. Such topics as the Hasse— Minkowski Theorem (which is in Chapter 1 
of Borevich and Shafarevich’s Number Theory) and Tate’s thesis (which is also 
available in textbook form, see Lang’s Algebraic Number Theory) are omitted. 
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Preface 


Moreover, there is no attempt to present results in their most general form. For 
example, p-adic L-functions corresponding to Dirichlet characters are only dis- 
cussed parenthetically in Chapter II. The aim is to present a selection of material 
that can be digested by undergraduates or beginning graduate students in a 
one-term course. 

The exercises are for the most part not hard, and are important in order to 
convert a passive understanding to a real grasp of the material. The abundance of 
exercises will enable many students to study the subject on their own, with 
minimal guidance, testing themselves and solidifying their understanding by 
working the problems. 

p-adic analysis can be of interest to students for several reasons. First of all, in 
many areas of mathematical research—such as number theory and representation 
theory—p-adic techniques occupy an important place. More naively, for a stu- 
dent who has just learned calculus, the *‘brave new world’’ of non- Archimedean 
analysis provides an amusing perspective on the world of classical analysis. 
p-adic analysis, with a foot in classical analysis and a foot in algebra and number 
theory, provides a valuable point of view for a student interested in any of those 
areas. 


I would like to thank Professors Mark Kac and Yu. I. Manin for their help 
and encouragement over the years, and for providing, through their teaching and 
writing, models of pedagogical insight which their students can try to emulate. 
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CHAPTER I 


p-adic numbers 


1. Basic concepts 


If X is a nonempty set, a distance, or metric, on X is a function d from pairs 
of elements (x, y) of X to the nonnegative real numbers such that 


(1) d(x, y) = 0 if and only if x = y. 
(2) d(x, y) = d(y, x). 
(3) d(x, y) < d(x, z) + d(z, y) for all ze X. 


A set X together with a metric d is called a metric space. The same set X can 
give rise to many different metric spaces (X, d), as we'll soon see. 

The sets X we'll be dealing with will mostly be fields. Recall that a field F 
is a set together with two operations + and - such that F is a commutative 
group under +, F — {0} is a commutative group under -, and the distributive 
law holds. The examples of a field to have in mind at this point are the field 
Q of rational numbers and the field R of real numbers. 

The metrics d we’ll be dealing with will come from norms on the field F, 
which means a map denoted || || from F to the nonnegative real numbers 
such that 


(1) ||x|| = 0 if and only if x = 0. 
(2) |lx-yl] = Ix] - [yl 
(3) x+y] < lx +l. 


When we say that a metric d “‘comes from” (or “‘is induced by”) a norm 
|| ||, we mean that d is defined by: d(x, y) = ||x — yl]. It is an easy exercise 
to check that such a d satisfies the definition of a metric whenever || || is a 
norm. 

A basic example of a norm on the rational number field @ is the absolute 
value |x|. The induced metric d(x, y) =|x — y| is the usual concept of 
distance on the number line. 
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My reason for starting with the abstract definition of distance is that the 
point of departure for our whole subject of study will be a new type of 
distance, which will satisfy Properties (1)}(3) in the definition of a metric 
but will differ fundamentally from the familiar intuitive notions. My reason 
for recalling the abstract definition of a field is that we'll soon need to be 
working not only with @ but with various “‘extension fields” which contain Q. 


2. Metrics on the rational numbers 


We know one metric on Q, that induced by the ordinary absolute value. Are 
there any others? The following is basic to everything that follows. 


Definition. Let p  {2, 3, 5,7, 11, 13, ...} be any prime number. For any 
nonzero integer a, let the p-adic ordinal of a, denoted ord, a, be the highest 
power of p which divides a, i.e., the greatest m such that a = 0 (mod p”™). 
(The notation a = b (mod c) means: c divides a — b.) For example, 


ord; 35 = 1, ord, 250 = 3, ord, 96 = 5, ord, 97 = 0. 


(If a = 0, we agree to write ord, 0 = 00.) Note that ord, behaves a little 
like a logarithm would: ord,(a,a,) = ord, a, + ord, ag. 

Now for any rational number x = a/b, define ord, x to be ord, a — 
ord, b. Note that this expression depends only on x, and not on a and 5, 
i.e., if we write x = ac/bc, we get the same value for ord, x = ord, ac — 
ord, be. 


Further define a map | |, on Q as follows: 


ifx £0; 


pss”? 
0, ifx =0. 

Proposition. | |, is a norm on Q. 

Proor. Properties (1) and (2) are easy to check as an exercise. We now verify 


(3). 

If x = Oor y = 0, or if x + y = 0, Property (3) is trivial, so assume x, y, 
and x + y are all nonzero. Let x = a/b and y = c/d be written in lowest 
terms. Then we have: x + y = (ad+ be)/bd, and ord,(x + y) = 
ord,(ad + bc) — ord, b — ord, d. Now the highest power of p dividing the 
sum of two numbers is at Jeast the minimum of the highest power dividing 
the first and the highest power dividing the second. Hence 


ord,(x + y) = min(ord, ad, ord, bc) — ord, b — ord, d 
= min(ord, a + ord, d, ord, b + ord, c) — ord, b — ord, d 
= min(ord, a — ord, b, ord, ¢ — ord, d) 
= min(ord, x, ord, y). 
Therefore, |x + y|, = p~%*+™ < max(p~%4>*, p~ 4s") = max(|x|p, |Ylp)> 
and this is < |x|, + |ylp. 


xl» > 
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We actually proved a stronger inequality than Property (3), and it is this 
stronger inequality which leads to the basic definition of p-adic analysis. 


Definition. A norm is called non-Archimedean if ||x + y|| < max(|x]], |lyl) 
always holds. A metric is called non-Archimedean if d(x,y) < 
max(d(x, z), d(z, y)); in particular, a metric is non-Archimedean if it is 
induced by a non-Archimedean norm, since in that case d(x, y) = 
|x — yl] = |@ — z) + @ — y)|| < max(||x — z], | z — yll) = max(d@, z), 
d(z, y)). 


Thus, | |, is a non-Archimedean norm on Q. 

A norm (or metric) which is not non-Archimedean is called Archimedean. 
The ordinary absolute value is an Archimedean norm on Q. 

In any metric space X we have the notion of a Cauchy sequence 
{a,, a), a3, ...} of elements of X. This means that for any ¢ > 0 there exists 
an N such that d(a,,, a,) < ¢ whenever both m > Nandn> N. 

We say two metrics d, and d, on a set X are equivalent if a sequence is 
Cauchy with respect to d, if and only if it is Cauchy with respect to d.. We 
say two norms are equivalent if they induce equivalent metrics. 

In the definition of | |,, instead of (1/p)**»* we could have written p4»* 
with any p (0, 1) in place of 1/p. We would have obtained an equivalent 
non-Archimedean norm (see Exercises 5 and 6). The reason why p = 1/p is 
usually the most convenient choice is related to the formula in Exercise 18 
below. 

We also have a family of Archimedean norms which are equivalent to 
the usual absolute value | |, namely | |* when 0 < a < 1 (see Exercise 8). 

We sometimes let | |. denote the usual absolute value. This is only a 
notational convention, and is not meant to imply any direct relationship 
between | |. and | |,. 

By the “trivial” norm we mean the norm || || such that |0]| = 0 and 
|x|] = 1 for x 4 0. 


Theorem 1 (Ostrowski). Every nontrivial norm || || on Q is equivalent to | |, 
for some prime p or for p = ©. 


ProoF. Case (i). Suppose there exists a positive integer m such that ||n|| > 1. 
Let no be the least such n. Since ||n|| > 1, there exists a positive real number 
« such that ||n|| = 1.%. Now write any positive integer n to the base no, i.e., 
in the form 


N = Ay + GyMo + Agno? +--+ + ato’, WhereO < a, < nm and a, # 0. 
Then 


[nl] < [aol] + llasrtol] + |laarto?|] +--+ + llasro*l] 
= |[aol] + llarl]-70% + |laal] -707% +--+ + |layl]- 70%. 


I p-adic numbers 


Since all of the a, are <mo, by our choice of my we have ||a,|| < 1, and hence 
I|72|| <sl+am%*+ No?” tees + ny 
= no(1 + mo ~* + m9 2% +--+ + 15%) 


n*| > Une), 


1=0 


IA 


because n = n°. The expression in brackets is a finite constant, which we 
call C. Thus, 


|7|| < Cn* for allm = 1, 2,3,.... 


Now take any n and any large N, and put 7” in place of n in the above 
inequality; then take Nth roots. You get 


In| < VCne. 


Letting N — oo for n fixed gives ||n|| < n*. 

We can get the inequality the other way as follows. If m is written to the 
base my as before, we have n}*+ > n > no’. Since |n}**|| = ||n + n§t? — nl < 
|||] + ||n§*+ — nll, we have 


In|] = |mo**| — |ng** — nl] 
> n@t be a2 (n+ ee n)*, 


since ||§*|| = ||mo||S*1, and we can use the first inequality (i.e., ||n|| < n*) 
on the term that is being subtracted. Thus, 


|||] = nGtP* — (n§*2 — no’)* (since n = no’) 


1 \e« 
= nigel = (1 ra +) 
No 
= C'n 
for some constant C’ which may depend on n, and « but not on x. As before, 
we now use this inequality for n¥, take Nth roots, and let N — oo, finally 
getting: ||n|| > n°. 

Thus, ||n|| = n*%. It easily follows from Property (2) of norms that |x|] = 
|x|? for all x € Q. In view of Exercise 8 below, which says that such a norm is 
equivalent to the absolute value | |, this concludes the proof of the theorem 
in Case (i). 

Case (ii). Suppose that ||n|| < 1 for all positive integers n. Let mo be the 
least n such that ||| < 1; mo exists because we have assumed that || || is 
nontrivial. 

No must be a prime, because if np = 1,-nN2 with n, and nz both <no, then 
7a] = ||n2l| = 1, and so ol] = ||n1|]- [lal] = 1. So let p denote the prime mo. 

We claim that ||q|| = 1 if q is a prime not equal to p. Suppose not; then 
\|q|| < 1, and for some large N we have ||q*|| = |lg||% < 4. Also, for some 
large M we have ||p™|| < 4. Since p™ and q” are relatively prime—have no 
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common divisor other than 1—we can find (see Exercise 10) integers n and m 
such that: mp” + nq” = 1. But then 


1 = [1] = llmp™ + ng" < |mp™|| + lng™|| = [nll lp™ | + Mall lle” 


by Properties (2) and (3) in the definition of a norm. But ||ml, ||n|| < 1, so 
that 
1< |p™| + la*l <4+4=1, 


a contradiction. Hence ||q|| = 1. 

We're now virtually done, since any positive integer a can be factored into 
prime divisors: @ = p,°1pq"2---p,. Then |jal| = || p,||>-||pall’2-- lpr’. 
But the only || p,|| which is not equal to 1 will be || p|| if one of the p,’s is p. Its 
corresponding 5, will be ord, a. Hence, if we let p = ||p|| < 1, we have 


Jal = pee 


It is easy to see using Property (2) of a norm that the same formula holds with 
any nonzero rational number x in place of a. In view of Exercise 5 below, 
which says that such a norm is equivalent to | |,, this concludes the proof 
of Ostrowski’s theorem. oO 


Our intuition about distance is based, of course, on the Archimedean 
metric | |... Some properties of the non-Archimedean metrics | |, seem very 
strange at first, and take a while to get used to. Here are two examples. 

For any metric, Property (3): d(x, y) < d(x, z) + d(z, y) is known as 
the “triangle inequality,” because in the case of the field C of complex 
numbers (with metric d(a + bi,c + di) =V(a — c)? + (b — d)*) it says 
that in the complex plane the sum of two sides of a triangle is greater than 
the third side. (See the diagram.) 


d(x, z) 
d(z, y) 


d(x, y) y 


Let’s see what happens with a non-Archimedean norm on a field F. For 
simplicity suppose z = 0. Then the non-Archimedean triangle inequality says: 
|x — yl] < max(||x]], ||y|]). Suppose first that the “sides” x and y have 
different “length,” say ||x|| < ||y||. The third side x — y has length 


|x — yl < [yl 
But 
|yll = lx — @ — y)| = max(|x], lx — yI). 
Since || y|| is not < ||x|], we must have || y|| < ||x — y|], andso || y|| = ||x — yl. 
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Thus, if our two sides x and y are not equal in length, the longer of the two 
must have the same length as the third side. Every “triangle” is isosceles! 

This really shouldn’t be too surprising if we think what this says in the 
case of | |, on Q. It says that, if two rational numbers are divisible by 
different powers of p, then their difference is divisible precisely by the lower 
power of p (which is what it means to be the same “size” as the bigger of 
the two). 

This basic property of a non-Archimedean field—that ||x + yl < 
max(||x||, |||), with equality holding if ||x|] 4 ||,y||—will be referred to as 
the ‘isosceles triangle principle” from now on. 

As a second example, we define the (open) disc of radius r (r is a positive 
real number) with center a (a is an element in the field F) to be 


D(a, r-) = {xe F| |x -— all < r}. 


Suppose || || is a non-Archimedean norm. Let b be any element in D(a, r~). 
Then 
D(a, r-) = D(b, r-), 
i.e., every point in the disc is a center! Why is this? Well 
xe D(a,r-) > |x -—all <r 
=> |x — db] = |@-a)+@-d)I 
< max(|x — all, la — d]/) 
<r 
=> xe D(b,r-), 
and the reverse implication is proved in the exact same way. 
If we define the closed disc of radius r with center a to be 
D(a, r) = {xe F | |x -— all <r}, 


for non-Archimedean || || we similarly find that every point in D(a, r) is a 
center. 
EXERCISES 

1. For any norm || || on a field F, prove that addition, multiplication, and 


finding the additive and multiplicative inverses are continuous. This means 
that: (1) for any x,ye@F and any « > 0, there exists 5 > 0 such that 
|x’ — xl] < 8 and ly’ — yl < 8 imply |\(x’ + »’) — & + y)I| < e; @) the 
same statement with ||(x’ + »’) — (x + y)|| replaced by ||x’y’ — xy||; (3) for 
any nonzero x € Fand any e > 0, there exists 5 > 0 such that |x’ — x|| < 6 
implies ||(1/x’) — (1/x)|| < e; (4) for any xe F and any e > 0, there exists 
& > O such that ||x’ — x|| < 5 implies |(—x’) — (—x)|| < «. 

2. Prove that if || || is any norm on a field F, then ||—1|| = ||1|| = 1. Prove that 
if || || is non-Archimedean, then for any integer n: ||n|| < 1. (Here “n” 
means the result of adding 1 + 1 + 1 +---+ 1 together m times in the 
field F.) 


3. 


10. 


11. 


12. 
13. 
14, 


15. 


Exercises 


Prove that, conversely, if || || is a norm such that ||n|| < 1 for every integer 7, 
then || || is non-Archimedean. 


. Prove that a norm || || on a field F is non-Archimedean if and only if 


{x€ F| |x| < I} N{xeF| |x — 1] <1} =. 


. Let {| ||: and || [|2 be two norms on a field F. Prove that || ||, ~ || ll2 if and 


only if there exists a positive real number « such that: |x|]; = ||x|2% for 
all xe F. 


. Prove that, if 0 < p < 1, then the function on xe Q defined as p%¢»* if 


x # Oand 0 if x = 0, is a non-Archimedean norm. Note that by the previous 
problem it is equivalent to | |,. What happens if p = 1? What about if p > 1? 


. Prove that | |p, is not equivalent to | |,, if p: and po are different primes. 


. For x € Q define |x|| = |x|* for a fixed positive number «, where | | is the 


usual absolute value. Show that || || is a norm if and only if @ < 1, and that 
in that case it is equivalent to the norm | |. 


. Prove that two equivalent norms on a field F are either both non-Archimedean 


or both Archimedean. 


Prove that, if N and M are relatively prime integers, then there exist integers 
nand m such that nN + mM = 1. 


Evaluate: 
(i) ordg 54 (ii) ord, 128 (iii) ords 57 

(iv) ord7(— 700/197) (v) ord,(128/7) (vi) ords(7/9) 

(vii) ords(— 0.0625) (viii) ord3(10°) (ix) ord3(— 13.23) 
(x) ord,(— 13.23) (xi) ords(— 13.23) (xii) ord,,(— 13.23) 


(xiii) ordig(— 26/169) — (xiv) ordios(— 1/309) = (xv) ord3(9!) 
Prove that ord,((p¥)!) = 1 + p + p? +-++ + pN7}. 
If 0 < a < p — 1, prove that: ord,((ap”)!) = a(1 + p + p? +--+ + pY¥~). 


Prove that, if nm = ao + aip + agp? + --- + asp® is written to the base p, 
so that 0 < a, < p — 1, and if we set S, = > a, (the sum of the digits to the 
base p), then we have the formula: 


ord,(n!) == a 

Evaluate |a — b|», i.e., the p-adic distance between a and 5, when: 

(i) a= 1,5 = 26,p=5 (ii) a = 1,5 = 26,p = © 
(iii) a = 1,6 = 26,p = 3 (iv) a = 1/9,b = —-1/16,p = 5 
(v) a= 1,5 = 244, p = 3 (vi) a = 1,b = 1/244, p = 3 
(vii) a = 1,b = 1/243, p = 3 (viii) a = 1,5 = 183, p = 13 
(ix) a = 1,5 = 183,p = 7 (x) a = 1,5 = 183,p = 2 
(xi) a = 1,56 = 183, p = «© (xii) a = 9!,6 = 0, p = 3 


(xiii) a = (9!)?/3°, b = 0, p = 3 (xiv) a = 27°/2",b = 0,p = 2 
(xv) a = 22%/(2%)!, b = 0, p = 2. 
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16. Say in words what it means for a rational number x to satisfy |x|, < 1. 


17. For x e€Q, prove that lim,..|x'‘/i!|, = 0 if and only if: ord, x => 1 when 
p # 2, ordg x = 2 when p = 2. 


18. Let x be a nonzero rational number. Prove that the product over al/ primes 
including © of |x|, equals 1. (Notice that this “infinite product” actually 
only includes a finite number of terms that are not equal to 1.) Symbolically, 


Tp |xlp = 1. 


19. Prove that for any p (4 ©), any sequence of integers has a subsequence which 
is Cauchy with respect to | |p. 


20. Prove that if x € @ and |x|, < 1 for every prime p, then x € Z. 


3. Review of building up the complex 
numbers 


We now have a new concept of distance between two rational numbers: two 
rational numbers are considered to be close if their difference is divisible by 
a large power of a fixed prime p. In order to work with this so-called “‘ p-adic 
metric”? we must enlarge the rational number field Q in a way analogous 
to how the real numbers R and then the complex numbers C were constructed 
in the classical Archimedean metric | |. So let’s review how this was done. 

Let’s go back even farther, logically and historically, than Q. Let’s go back 
to the natural numbers N = {1, 2, 3, ...}. Every step in going from N to C 
can be analyzed in terms of a desire to do two things: 


(1) Solve polynomial equations. 

(2) Find limits of Cauchy sequences, i.e., “complete” the number system to 
one “without holes,” in which every Cauchy sequence has a limit in 
the new number system. 


First of all, the integers Z (including 0, —1, —2, ...) can be introduced as 
solutions of equations of the form 


a+x=5, a, beN. 


Next, rational numbers can be introduced as solutions of equations of the 
form 
ax =}, a, beZ. 


So far we haven’t used any concept of distance. 

One of the possible ways to give a careful definition of the real numbers is 
to consider the set S of Cauchy sequences of rational numbers. Call two 
Cauchy sequences s, = {a,} € Sand s, = {b,} e S equivalent, and write s, ~ s2, 
if ja, — b,| > 0 as j > 00. This is obviously an equivalence relation, that is, 
we have: (1) any s is equivalent to itself; (2) if 5; ~ sa, then s. ~ s,; and 
(3) if 5, ~ sg and sq ~ Sa, then s; ~ 53. We then define R to be the set of 
equivalence classes of Cauchy sequences of rational numbers. It is not hard 
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to define addition, multiplication, and finding additive and multiplicative 
inverses of equivalence classes of Cauchy sequences, and to show that R is a 
field. Even though this definition seems rather abstract and cumbersome at 
first glance, it turns out that it gives no more nor less than the old-fashioned 
real number line, which is so easy to visualize. 

Something similar will happen when we work with | |, instead of | |: 
starting with an abstract definition of the p-adic completion of Q, we'll get a 
very down-to-earth number system, which we'll call Q,. 

Getting back to our historical survey, we’ve gotten as far as R. Next, 
returning to the first method—solving equations—mathematicians decided 
that it would be a good idea to have numbers that could solve equations like 
x? + 1=0. (This is taking things in logical order; historically speaking, 
the definition of the complex numbers came before the rigorous definition 
of the real numbers in terms of Cauchy sequences.) Then an amazing thing 
happened! As soon as i = V—1 was introduced and the field of complex 
numbers of the form a + bi, a, be R, was defined, it turned out that: 


(1) All polynomial equations with coefficients in C have solutions in C—this 
is the famous Fundamental Theorem of Algebra (the concise terminology 
is to say that C is algebraically closed); and 

(2) Cis already ‘“‘complete” with respect to the (unique) norm which extends 
the norm | | on R (this norm is given by |a + bi] = Va? + 5), ie., any 
Cauchy sequence {a, + 5,i} has a limit of the form a + bi (since {a,;} and 
{b,} will each be Cauchy sequences in R, you just let a and b be their 
limits). 


So the process stops with C, which is only a “‘ quadratic extension” of R 
(i.e., obtained by adjoining a solution of the quadratic equation x? + 1 = 0). 
C is an algebraically closed field which is complete with respect to the Archime- 
dean metric. 

But alas! Such is not to be the case with | |,. After getting Q,, the comple- 
tion of Q with respect to | |,, we must then form an infinite sequence of 
field extensions obtained by adjoining solutions to higher degree (not just 
quadratic) equations. Even worse, the resulting algebraically closed field, 
which we denote Q,, is not complete. So we take this already gigantic field 
and “‘fill in the holes” to get a still larger field Q. 

What happens then? Do we now have to enlarge © to be able to solve 
polynomial equations with coefficients in Q? Does this process continue on 
and on, in a frightening spiral of ever more far-fetched abstractions? Well, 
fortunately, with Q the guardian angel of p-adic analysis intervenes, and it 
turns out that Q is already algebraically closed, as well as complete, and our 
search for the non-Archimedean analogue of C is ended. 

But this Q, which will be the convenient number system in which to study 
the p-adic analogy of calculus and analysis, is much less thoroughly 
understood than C. As I. M. Gel’fand has remarked, some of the simplest 
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questions, e.g., characterizing Q,-linear field automorphisms of 2, remain 
unanswered. 
So let’s begin our journey to 2. 


4. The field of p-adic numbers 


For the rest of this chapter, we fix a prime number p # ©o. 

Let S be the set of sequences {a,} of rational numbers such that, given 
e > 0, there exists an N such that |a, — a,|, < e if both i, i’ > N. We call 
two such Cauchy sequences {a,} and {b,} equivalent if |a, — b,|,->0 as 
i—> oo. We define the set Q, to be the set of equivalence classes of Cauchy 
sequences. 

For any x € Q, let {x} denote the “‘constant’’ Cauchy sequence all of whose 
terms equal x. It is obvious that {x} ~ {x’} if and only if x = x’. The equiva- 
lence class of {0} is denoted simply by 0. 

We define the norm | |, of an equivalence class a to be lim, |@|p, 
where {a,} is any representative of a. The limit exists because 


(1) If a = 0, then by definition lim,.., [a,|, = 0. 
(2) If a # 0, then for some « and for every N there exists an iy > N with 
laiylp > 


If we choose N large enough so that |a, — a,;|, < « when i, i’ > N, we have: 
la — aylp < ¢ foralli> N. 


Since |a,,|, > ¢, it follows by the “isosceles triangle principle” that |a,|, = 
|a:y|p- Thus, for all i > N, |a,|, has the constant value |a,,|,. This constant 
value is then limy.... |a|>. 

One important difference with the process of completing Q to get R should 
be noted. In going from Q to R the possible values of | | =| |. were 
enlarged to include all nonnegative real numbers. But in going from Q@ to Q, 
the possible values of | |, remain the same, namely {p"},<z LU {0}. 

Given two equivalence classes a and b of Cauchy sequences, we choose 
any representatives {a,} € a and {b,} € b, and define a-b to be the equivalence 
class represented by the Cauchy sequence {a,,}. If we had chosen another 
{a,} € a and {b,'} € b, we would have 


|a,'b,’ — a;bi\p = |a,'(b;’ — 5) + ba’ — a)|p 
< max(|a;'(b;’ — 5)|,, |b(a,’ — a)|p); 


as i-> oo, the first expression approaches |a|,- lim |b,’ — 5,|, = 0, and the 
second expression approaches |5|,-lim|a,’ — a,|, = 0. Hence {a,'b,} ~ {a,b,}. 

We similarly define the sum of two equivalence classes of Cauchy se- 
quences by choosing a Cauchy sequence in each class, defining addition 
term-by-term, and showing that the equivalence class of the sum only 
depends on the equivalence classes of the two summands. Additive inverses 
are also defined in the obvious way. 
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For multiplicative inverses we have to be a little careful because of the 
possibility of zero terms in a Cauchy sequence. However, it is easy to see that 
every Cauchy sequence is equivalent to one with no zero terms (for example, 
if a, = 0, replace a, by a,’ = p'). Then take the sequence {1/a,}. This sequence 
will be Cauchy unless |a;|, — 0, i.e., unless {a,} ~ {0}. Moreover, if {a,} ~ {a,'} 
and no a, or a,’ is zero, then {1/a,} ~ {1/a,'} is easily proved. 

It is now easy to prove that the set Q, of equivalence classes of Cauchy 
sequences is a field with addition, multiplication, and inverses defined as 
above. For example, distributivity: Let {a,}, {b,}, {c,} be representatives of 
a, b,c € Q,; then a(b + c) is the equivalence class of 


{a,(b, + c)} = {ab, + a,c}, 


and ab + ac is also the equivalence class of this sequence. 

Q can be identified with the subfield of Q, consisting of equivalence classes 
containing a constant Cauchy sequence. Under this identification, note that 
| |p on Q, restricts to the usual | |, on Q. 

Finally, it is easy to prove that Q, is complete: if {a,},-1,2,.. is a sequence 
of equivalence classes which is Cauchy in Q,, and if we take representative 
Cauchy sequences of rational numbers {a;;};-1,2,... for each a;, where for 
each j we have |aj; — aj|) < p 4 whenever i, i’ > N,, then it is easily shown 
that the equivalence class of {aj ,}j=1,2,... is the limit of the a;. We leave the 
details to the reader. 

It’s probably a good idea to go through one such tedious construction in 
any course or seminar, so as not to totally forget the axiomatic foundations 
on which everything rests. In this particular case, the abstract approach also 
gives us the chance to compare the p-adic construction with the construction 
of the reals, and see that the procedure is logically the same. However, after 
the following theorem, it would be wise to forget as rapidly as possible 
about “‘equivalence classes of Cauchy sequences,” and to start thinking in 
more concrete terms. 


Theorem 2. Every equivalence class a in Q, for which |a|, < 1 has exactly one 
representative Cauchy sequence of the form {a,} for which: 


(1) 0 < a, < p' fori = 1,2,3,.... 
(2) a = a,., (mod p’) for i = 1,2,3,.... 


ProorF. We first prove uniqueness. If {a,’} is a different sequence satisfying (1) 
and (2), and if a,, # a,,’, then a,, # a,, (mod p's), because both are between 
0 and p‘, But then, for all i > ip, we have a, = a,, # a,’ = a,’ (mod p*), 
i.e., a, # a,’ (mod pc), Thus 

Ja, — a,'|, > 1/pto 


for all i > ig, and {a,} ~ {a,'}. 
So suppose we have a Cauchy sequence {b,}. We want to find an equivalent 
sequence {a,} satisfying (1) and (2). To do this we use a simple lemma. 
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Lemma. [f x € Q and |x|, < 1, then for any i there exists an integer « € Z such 
that |« — x|, < p~‘. The integer « can be chosen in the set {0, 1, 2, 3, ..., 
p' — I}. 


PROOF OF LEMMA. Let x = a/b be written in lowest terms. Since |x|, < 1, 
it follows that p does not divide b, and hence 5 and p‘ are relatively prime. So 
we can find integers m and n such that: mb + np! = 1. Let a = am. The idea 
is that mb differs from 1 by a p-adically small amount, so that m is a good 
approximation to 1/b, and so am is a good approximation to x = a/b. More 
precisely, we have: 


ja — x|, = |am — (a/b)|, = |a/b|, |mb — |, 
< |mb — 1|, = |np'|, = |n\|,/p' < Ip". 


Finally, we can add a multiple of p‘ to the integer « to get an integer between 
0 and p' for which |« — x|, < p~‘ still holds. The lemma is proved. Oo 


Returning to the proof of the theorem, we look at our sequence {b,}, and, 
for every j = 1, 2,3, ..., let N(j) be a natural number such that |b, — b,|, < 
p~’ whenever i, i’ > N(j). (We may take the sequence N(j) to be strictly 
increasing with j; in particular, N(j) = j.) Notice that |b,|, < 1 if i => N(1), 
because for all i’ => N(1) 


lbilp max(|b;-|,, |b, a: by |p) 


max(|b,-|,, 1/p), 
and |b,-|,— |alp < 1 as i’ > 0. 

We now use the lemma to find a sequence of integers a,, where 0 < a; < p,, 
such that 


lA IA 


la; — buy < 1/p’. 


I claim that {a,} is the required sequence. It remains to show that a,;,, = a, 
(mod p’) and that {b,} ~ {a,}. 
The first assertion follows because 
laj41 — Alp = [Aj41 — buyer) + bugsry — Suq — @ — bap 
< max(|aj41 — byg+ale lbwg+1) — Sucle las — Svcnle) 
< max(1/p’*?, 1/p’, 1/p’) 
= 1/p’. 
The second assertion follows because, given any j, for i => N(j) we have 
Ja, — bilp = la — a; + a; — buy — (& — bvep 
< max(|a; — |p, [ay — buys |b: — Bucnly) 
< max(1/p’, 1/p’, 1/p’) 
= I/p’. 
Hence |a, — 5,|, > 0 as i—> 00. The theorem is proved. QO 
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What if our p-adic number a does not satisfy |a|, < 1? Then we can 
multiply a by a power p™ of p (namely, by the power of p which equals |a],), 
to get a p-adic number a’ = ap™ which does satisfy |a’], < 1. Then a’ is 
represented by a sequence {a,’} as in the theorem, and a = a’p~™ is repre- 
sented by the sequence {a,} in which a, = a,'p~™. 

It is now convenient to write all the a,’ in the sequence for a’ to the base p, 
ie., 


a,’ = bo + bp + bop? +--+ + by-yp'?, 


where the b’s are all “digits,” i.e., integers in {0, 1, ..., p — 1}. Our condition 
a,’ = a;,, (mod p‘) precisely means that 


Qi41 = by + bp + bap? +--+ by pi? + bp, 


where the digits b) through b,_, are all the same as for a,’. Thus, a’ can be 
thought of intuitively as a number, written to the base p, which extends 
infinitely far to the right, i.e., we add a new digit each time we pass from a,’ 
to 444. 

Our original a can then be thought of as a base p decimal number which 
has only finitely many digits “‘to the right of the decimal point” (i.e., corres- 
ponding to negative powers of p, but actually written starting from the left) 
but has infinitely many digits for positive powers of p: 


by 
pt 


bm-1 


b 
a=—a+ to FES + Bn + Bm sae + Baap? + °°. 


Here for the time being the expression on the right is only shorthand for the 
sequence {a,}, where a, = bop™™ + --- + b,_,p'~1~™”, that is, a convenient way 
of thinking of the sequence {a,} all at once. We’ll soon see that this equality 
is in a precise sense “real” equality. This equality is called the “‘p-adic 
expansion” of a. 

We let Z, = {ae Q, | |a|, < 1}. This is the set of all numbers in Q, 
whose p-adic expansion involves no negative powers of p. An element of Z, 
is called a “p-adic integer.” (From now on, to avoid confusion, when we 
mean an old-fashioned integer in Z, we'll say “‘rational integer.””) The sum, 
difference, and product of two elements of Z, is in Z,, so Z, is what’s called a 
“‘subring”’ of the field Q,. 

If a, be Q,, we write a = b (mod p") if |a — b|, < p~*, or equivalently, 
(a — b)/p" € Z,, i.e., if the first nonzero digit in the p-adic expansion of a — b 
occurs no sooner than the p"-place. If a and b are not only in Q, but are 
actually in Z (i.e., are rational integers), then this definition agrees with the 
earlier definition of a = b (mod p"). 

We define Z, * as {x €Z, | 1/x €Z,}, or equivalently as {x € Z, | x # 0(modp)}, 
or equivalently as {x eZ, | |x|, = 1}. A p-adic integer in Z,*—i.e., whose 
first digit is nonzero—is sometimes called a “‘p-adic unit.” 
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Now let {5/2 _, be any sequence of p-adic integers. Consider the sum 


y= OSB SOB dt ba + Bap + bap? to + Dap", 
This sequence of partial sums is clearly Cauchy: if M > N, then |Sy — Sy], 
< 1/p’. It therefore converges to an element in @,. As in the case of infinite 
series of real numbers, we define 5/2 _, b,p' to be this limit in Q,. 

More generally, if {c,} is any sequence of p-adic numbers such that |c,|, > 0 
as i —> 00, the sequence of partial sums Sy = c, + cz +--+ +cy converges to 
a limit, which we denote 72, ¢. This is because: |Sy — Sylp = 
lever + Cnsa toes + Culp S max(|cy+ily, |Cvs+alps**-> l¢ul|p) which — 0 as 
N — oo. Thus, p-adic infinite series are easier to check for convergence than 
infinite series of real numbers. A series converges in Q, if and only if its terms 
approach zero. There is nothing like the harmonic series 1 + 4+4+4+-:- 
of real numbers, which diverges even though its terms approach 0. Recall 
that the reason for this is that | |, of a sum is bounded by the maximum 
(rather than just the sum) of the| |, of the summands when p # ©, i.e., when 
| |p is non-Archimedean. 

Returning now to p-adic expansions, we see that the infinite series on the 
right in the definition of the p-adic expansion 


eee Host PBL By + BmasD + Bmesd? + >>> 


(here 5, € {0, 1,2, ..., p — 1}) converges to a, and so the equality can be 
taken in the sense of the sum of an infinite series. 

Note that the uniqueness assertion in Theorem 2 is something we don’t 
have in the Archimedean case. Namely, terminating decimals can also be 
represented by decimals with repeating 9s: 1 = 0.9999.---. But if two p-adic 
expansions converge to the same number in Q,, then they are the same, i.e., 
all of their digits are the same. 

One final remark. Instead of {0, 1, 2, ..., — 1} we could have chosen 
any other set S = {a, 1, a, ..., &p-3} Of p-adic integers having the property 
that «, = i(mod p) for i = 0, 1,2, ...,p — 1, and could then have defined 
our p-adic expansion to be of the form >. _,, bp', where now the “digits” 5, 
are in the set S rather than in the set {0, 1, ..., p — 1}. For most purposes, 
the set {0, 1, ..., p — 1} is the most convenient. But there is another set S, 
the so-called ‘‘Teichmiiller representatives” (see Exercise 13 below), which 
is in some ways an even more natural choice. 


5. Arithmetic in Q, 


The mechanics of adding, subtracting, multiplying, and dividing p-adic 
numbers is very much like the corresponding operations on decimals which 
we learn to do in about the third grade. The only difference is that the 
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“carrying,” “‘borrowing,” “long multiplication,” etc. go from left to right 
rather than right to left. Here are a few examples in Q,: 


3+6x7+2x PH+--- 2x7414+0x74+3xTF+-:-- 
x445x741x P+. —-4x7346xP+5x P+. 
544x744x P+ 5x7 +0xP+4xPp 
1x7+4x 7PH+--- 
3x P+--- 


5+5x7+4x P+--- 


5+1x7+6x 7?4+--- 
34+5x74+1x 74+---/PF2x7+4x P+ 
1+6x74+1x 7+--- 
3x7+2x« Ft... 
3x74+5x P+--- 
4x 7274... 
4x 724... 


As another example, let’s try to extract /6 in Q,, i.e., we want to find 
Ao, 41, Ag, ...,0 < a, < 4, such that 


(a9 +a,x 5+ a,x 2+---PHalt+ix 5. 


Comparing coefficients of 1 = 5° on both sides gives a)? = 1 (mod 5), and 
hence a, = | or 4. Let’s take ay = 1. Then comparing coefficients of 5 on 
both sides gives 2a, x 5 = 1 x 5 (mod 52), so that 2a, = 1 (mod 5), and 
hence a, = 3. At the next step we have: 


l+1xS5e2(+3 St+a,~x 5%? =14+1x 54+ 2a, x 57 (mod 59). 
Hence 2a, = 0 (mod 5), and a, = 0. Proceeding in this way, we get a series 
a=1+3x5+0x 5?7+4x S+a,x S¢+as x 5S +--- 


where each a, after ay is uniquely determined. 
But remember that we had two choices for aj, namely 1 and 4. What if we 
had chosen 4 instead of 1? We would have gotten 


-a=44+1x5+4~x 54+0x 5? 
+ (4 — a) x 54+ (4 — a) x 58+ 


The fact that we had two choices for ao, and then, once we chose ao, only a 
single possibility for a,, a2, a3, ..., merely reflects the fact that a nonzero 
element in a field like Q or R or Q, always has exactly two square roots in the 
field if it has any. 

Do all numbers in Q; have square roots? We saw that 6 does, what about 
7? If we had 


(Qa +a, x54+---2?=24+1x 5, 
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it would follow that a)? = 2 (mod 5). But this is impossible, as we see by 
checking the possible values a) = 0, 1, 2, 3,4. For a more systematic look 
at square roots in Q,, see Exercises 6-12. 

This method of solving the equation x? — 6 = 0 in Q;—by solving the 
congruence a)? — 6 = 0 (mod 5) and then solving for the remaining a; in a 
step-by-step fashion—is actuaily quite general, as shown by the following 
important “lemma.” This form of the lemma was apparently first given in 
Serge Lang’s Ph.D. thesis in 1952 (Annals of Mathematics, Vol. 55, p. 380). 


Theorem 3 (Hensel’s lemma). Let F(x) = co + (:X +--+: + Cx" be a poly- 
nomial whose coefficients are p-adic integers. Let F'(x) = cy + 2cox + 
3cgx? +--+ + ne,x"—} be the derivative of F(x). Let ag be a p-adic integer 
such that F(ay) = 0 (mod p) and F'(a)) # 0 (mod p). Then there exists a 
unique p-adic integer a such that 


F(a) =0 and a= a) (mod p). 


(Note: In the special case treated above, we had F(x) = x? — 6, F'(x) = 
2x, a= 1.) 


PROOF OF HENSEL’S LEMMA. I claim that there exists a unique sequence 
of rational integers a,, ao, a3, ... such that for alln > 1: 
(1) F(a,) = 0 (mod p**?). 
(2) a, = a,-1 (mod p"). 
(3) 0 < a, < p**?. 


We prove that such a, exist and are unique by induction on n. 

If n = 1, first let @ be the unique integer in {0, 1, ..., p — 1} which is 
congruent to a) mod p. Any a, satisfying (2) and (3) must be of the form 
d, + bp, where0 < b, < p — 1. Now, looking at F(@ + bp), we expand the 
polynomial, remembering that we only need congruence to 0 mod p?, so 
that any terms divisible by p? may be ignored: 


F(a,) = F(@ + bip) = > ci(@> + bi p)' 

= > (co! + ic,a@i-1b,p + terms divisible by p?) 

= > ado! + (> ica *)b,p (mod p?) 

= F(@) + F'(G)bip. 
(Note the similarity to the first order Taylor series approximation in calculus: 
F(x + h) = F(x) + F’(x)h + higher order terms.) Since F(a.) = 0 (mod p) 
by assumption, we can write F(d)) = «p (mod p?) for some a €{0, 1, ..., p — 1}. 
So in order to get F(a,) = 0(mod p”) we must get ap + F'(&)bip = 0 
(mod p?), i.e., « + F’(a)b, = 0(mod p). But, since F’(a)) # 0 (mod p) by 
assumption, this equation can always be solved for the unknown b,. Namely, 
using the lemma in the proof of Theorem 2, we choose b, € {0, 1, ...,p — 
so that A, = —a/F’(G@)(modp). Clearly this b,€{0,1,...,p — 1} is 
uniquely determined by this condition. 
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Now, to proceed with the induction, suppose we already have ay, az, ..., 
a,_,. We want to find a,. By (2) and (3), we need a, = a@,-,; + b,p" with 
b, €{0, 1, ...,p — 1}. We expand F(a,_, + 5,p") as we did before in the 
case n = I, only this time we ignore terms divisible by p**1. This gives us: 


F(a,) = F(a,-1 + b,p") = F(a, -1) + F'(@,-1)bnp" (mod p**?). 


Since F(a,-,) = 0(mod p") by the induction assumption, we can write 
F(a@,-1) = «'p" (mod p"*?), and our desired condition F(a,) = 0 (mod p**?) 
now becomes 

a'p™ + F'(a,-1)b,p” = 0 (mod p***),  ie., a’ + F’(a,_-1)b, = 0 (mod p). 


Now, since a,_; = @ (mod p), it easily follows that F’(a,_,) = F'(ao) # 0 
(mod p), and we can find the required 5, € {0, 1, ...,p — 1} proceeding 
exactly as in the case of 5,, ie., solving b, = —a'/F’(a,-,) (mod p). This 
completes the induction step, and hence the proof of the claim. 

The theorem follows immediately from the claim. We merely let a = 
> + bip + bop? + --- .Since for alln we have F(a) = F(a,) = 0 (mod p"*?), 
it follows that the p-adic number F(a) must be 0. Conversely, any a = d + 
bp + bap? + --- gives a sequence of a, as in the claim, and the uniqueness 
of that sequence implies the uniqueness of the a. Hensel’s lemma is proved. 1 


Hensel’s lemma is often called the p-adic Newton’s lemma because the 
approximation technique used to prove it is essentially the same as Newton’s 


Figure I.1. Newton’s method in the real case 
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method for finding a real root of a polynomial equation with real coefficients. 
In Newton’s method in the real case, (see Figure I.1), if f’(a,-1) # 0, we take 
sgeeot st flGusa): 

nee f ‘(Gn - A) 
The correction term —/(a,_1)/f'(@n-1) is a lot like the formula for the 
“correction term” in the proof of Hensel’s lemma: 


a, = 


a = a‘p" — _ F@n-1) +1 
LP SGD EG 


In one respect the p-adic Newton’s method (Hensel’s lemma) is much 
better than Newton’s method in the real case. In the p-adic case, it’s guaranteed 
to converge to a root of the polynomial. In the real case, Newton’s method 
usually converges, but not always. For example, if you take f(x) = x° — x 
and make the unfortunate choice a) = 1/5, you get: 

a, = V5 — [1/505 — 1/V'5)/3/5 — 1) 
= 1/V5[1 — (1/5 — 19/5 — 1)] = -1/V5; 
a, = 1/V5; a3 = —1/V5, ete. 
(See Figure I.2.) Such perverse silliness is impossible in Q,. 


Figure I.2, Failure of Newton’s method in the real case 
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EXERCISES 


1. 


2. 


10. 
11. 


12. 


If aeQ, has p-adic expansion a-mp~" + a_-msip~™t! + +++ + ao + 
ap + ---, what is the p-adic expansion of —a? 


Find the p-adic expansion of: 


(Gi) (6+4x74+2x P+1x F+---)\3B+0xK7+0x 74+6x 7+---) 
in Q, to 4 digits 

Gi) 1/3 +2x 5+3 x 5274+ 1 x 53 +.---)in Qs to 4 digits 

(iii) 9 x 112 — (3 x 11724241 x 11743 x 112 4+---) in Quy to 4 


digits 
(iv) 2/3 in Q2 (v) —1/6in Q, (vi) 1/10 in Qy, 
(vii) —9/16 in Qy3 (viii) 1/1000 in Q; (ix) 6! in Qs 
(x) 1/3! in Q3 (xi) 1/4! in Q, (xii) 1/5! in Q, 


Prove that the p-adic expansion of a nonzero a € Q, terminates (i.e., a; = 0 for all 
i greater than some WV) if and only if a is a positive rational number whose 
denominator is a power of p. 


. Prove that the p-adic expansion of a € Q, has repeating digits from some point 


on (i.e., 444, = a; for some r and for all i greater than some JN) if and only 
ifae Q. 


. What is the cardinality of Z,? Prove your answer. 


. Prove the following generalization of Hensel’s lemma: Let F(x) be a poly- 


nomial with coefficients in Z,. If aj éZ, satisfies F’(ao)= 0 (mod p™) but 
F’(ao) = 0 (mod p™*?), and if F(ao) = 0 (mod p?™ *?), then there is a unique 
aeéZ, such that F(a) = 0 and a = ap (mod p™*?), 


. Use your proof in Exercise 6 to find a square root of —7 in Qg to 5 digits. 


. Which of the following 11-adic numbers have square roots in Q,,? 


(i) 5 (ii) 7 (iii) —7 
(iv) 5+ 3x 114+ 9x 1174+ 1x 118 

(vy) 3 x 1172+ 6x 11-'+3+0x 11+7-x 11? 
(vi) 3 x 117°74+64+3x 11+0x 1174+7 x 118 
(vii) 1 x 117 (viii) 7 — 6 x 11? 
(ix) 5 x 1172 + Sion x 11". 


. Compute +,/—1 in Q, and +,/—3 in Q, to 4 digits. 


For which p = 2, 3, 5,7, 11, 13, 17, 19 does —1 have a square root in Q,? 


Let p be any prime besides 2. Suppose « € Q, and |a|, = 1. Describe a test 
for whether a@ has a square root in Q,. What about if |a|, # 1? Prove that 
there exist four numbers «;, a, «3, e4, € Q, such that for all nonzero «a € Q, 
exactly one of the numbers a1, 2a, aga, a4 has a square root. (In the case 
when p is replaced by © and Q, by R, there are two numbers, for example 
+1 will do, such that for every nonzero a € R exactly one of the numbers 
1-c@ and —1-a has a square root in R.) 


The same as Exercise 1! when p = 2, except that now there will be eight 
numbers o, ..., %3 € Q@z such that for all nonzero a € @2 exactly one of the 
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numbers o,a, ...,@g« has a square root in Qg. Find such a, ..., ag (the 
choice of them is not unique, of course). 


13. Find all 4 fourth roots of 1 in Qs to four digits. Prove that Q, always contains 
Psolutions ao, a1, ..., @p-1 to the equation x” — x = 0, wherea, = i (mod p). 
These p numbers are called the ‘‘ Teichmiiller representatives” of {0, 1,2, ..., 
p — 1} and are sometimes used as a set of p-adic digits instead of 
{0, 1, 2,...,p — 1}. If p > 2, which Teichmiiller representatives are rational? 


14. Prove the following “Eisenstein irreducibility criterion” for a polynomial 
L(x) = ao + ax +--+ + anx" with coefficients a; € Z,: If a; = 0 (mod p) for 
i=0,1,2,...,” —1, if a, 40(modp), and if a. 4 0 (mod p?), then f(x) 
is irreducible over Q,, i.e., it cannot be written as a product of two lower 
degree polynomials with coefficients in Q,. 


15. If p > 2, use Exercise 14 to show that 1 has no pth root other than | in Q,. Prove 
that if p > 2, then the only roots of | in Q, are the nonzero Teichmiiller represen- 
tatives; and in Q, the only roots of | are +1. 


16. Prove that the infinite sum 1 + p + p? + p® + --- converges to 1/(1 — p) in 
Q,. What about 1 — p + p? — p® + p* — p> + --- ? What about 1 + 
(p — 1)p + p? + (p — 1)p? + p* + (p —- 1)pP +---? 


17. Show that (a) every element x € Z, has a unique expansion of the form x = dg + 
a,(—p) + a,(—p)? +--+. + a,(—p)" + ---, with a; € {0, 1,..., p — 1}, and (b) this 
expansion terminates if and only if x € Z. 


18. Suppose that 7 is a (positive or negative) integer not divisible by p, and let 
« = 1 (mod p). Show that « has an vth root in Q,. Give a counter-example 
if n = p. Show that « has a pth root if « = 1 (mod p?) and p # 2. 


19, Let «€Z,. Prove that a?” = a?”~* (mod p”) for M = 1, 2, 3,4, .... Prove 
that the sequence {«?”} approaches a limit in Q,, and that this limit is the 
Teichmiiller representative congruent to « mod p. 


20. Prove that Z, is sequentially compact, i.e., every sequence of p-adic integers 
has a convergent subsequence. 


21. Define matrices with entries in Q,, their sums, products, and determinants 
exactly as in the case of the reals. Let M = {r x r matrices with entries in Z,}, 
let M* = {Ae M]|A has an inverse in M} (it’s not hard to see that this is 
equivalent to: det AeZ,”*), and let pM = {AE M|A = pB with Be M}. 
If Ae M* and Be pM, prove that there exists a unique X € M” such that: 
X?-—AX+B=0. 
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CHAPTER II 


p-adic interpolation of the Riemann 
zeta-function 


This chapter is logically independent of the following chapters, and is pre- 
sented at this point in the middle of our ascent to Q as a plateau in the level 
of abstraction—namely, everything in this chapter still takes place in the 
fields Q, Q,, and R. 

The Riemann ¢-function is defined as a function of real numbers greater 
than 1 by 


— 1 
US) Da 


It is easy to see (by comparison with the integral ‘Pa (dx/x*) = 1/(s — 1) for 
fixed s > 1) that this sum converges when s > 1. 

Let p be any prime number. The purpose of this chapter is to show that 
the numbers (2k) for k = 1, 2,3, ... have a “p-adic continuity property.” 
More precisely, consider the set of numbers 


fk) = (1 — p%*-*) Se 42k), where c, = (— 1)" PEED, 


as 2k runs through all positive even integers in the same congruence class 
mod(p — 1). It turns out that /(2k) is always a rational number. Moreover, 
if two such values of 2k are close p-adically (i.e., their difference is divisible 
by a high power of p), then we shall see that the corresponding f(2k) are also 
p-adically close. (We must also assume that 2k is not divisible by p — 1.) 
This means that the function fcan be extended in a unique way from integers 
to p-adic integers so that the resulting function is a continuous function of a 
p-adic variable with values in Q,. (“Continuous function” means, as in the 
real case, that whenever a sequence of p-adic integers {x,} approaches x 
p-adically, {f(x,)} approaches f(x) p-adically.) 
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This is what is meant by p-adic “‘interpolation.” The process is analogous 
to the classical procedure for, say, defining the function f(x) = a* (where a 
is a fixed positive real number): first define f(x) for fractional x; then prove 
that nearby fractional values of x give nearby values of a*; and, finally, 
define a* for x irrational to be the limit of a*» for any sequence of rational 
numbers x, which approach x. 

Notice that a function fon the set S of, for example, positive even integers 
can be extended in at most one way to a continuous function on Z, (assume 
p # 2). This is because S is “‘dense” in Z,—any x € Z, can be written as a 
limit of positive even integers x,. If f is to be continuous, we must have 
S(x) = lim, f(%,). In the real case, while the rational numbers are dense 
in R, the set S is not. It makes no sense to talk of “the” continuous real- 
valued function which interpolates a function on the positive even integers; 
there are always infinitely many such functions. (However, there might be a 
unique real-valued continuous interpolating function which has additional 
convenient properties: for example, the gamma-function I'(x + 1) inter- 
polates k ! when x = k is a nonnegative integer, it satisfies '(x + 1) = xI'(x) 
for all real x, and its logarithm is a convex function for x > 0; the gamma- 
function is uniquely characterized by these properties.) 


1. A formula for ¢(2k) 


The kth Bernoulli number B,, is defined as k! times the kth coefficient in the 
Taylor series for 
et eo ae 
eo— 1 b+ 4/2! + 27/3! 4+ 5/4l4+--- + O72 + DI +--- 
HD, Beth/kl. 
k=0 


The first few B, are: 
By = 1, B, = -1/2, B, = 1/6, B; = 0, 
B, = —1/30, B; = 0, Bo = 1/42, .... 


We now derive the formula: 


rOb) = (~ Df en (Be) fork = 1.2.3 
(2k — 1)! 2k ls Dyreea ts 


Recall the definition of the “hyperbolic sine,” abbreviated sinh (and 
pronounced ‘“‘sinch’’): 


. ~ ex 
sinh x = 5) 
It is equal to its Taylor series 
P x3 xs xrkt 
Stab ety gy oo oe yy fey 
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obtained by averaging the series for e* and —e~*. Note that this Taylor 
series is the same as that for sin x, except without the alternation of sign. 


Proposition. For all real numbers x, the infinite product 
CJ 2 
wx] ] (1 + *) 
n=1 n 
converges and equals sinh(7x). 


Proor. Convergence of the infinite product is immediate from the logarithm 


test: 
2 2 
lo (1 4 =) 
2 g nm 


We start by deriving the infinite product for sin x. 


@ x2 
< > — <0 forall x. 
nat 


Lemma. Let n = 2k + 1 be a positive odd integer. Then we can write 
sin(nx) = P, (sin x) 
cos(nx) = cos x Q,-, (sin x) 
where P,, (respectively Q,,- 1) is a polynomial of degree at most n (respectively 


n — 1) with integer coefficients. 


PROOF OF LEMMA. We use induction on k. The lemma is trivial for k = 0 
(i.e., 2 = 1). Suppose it holds for k — 1. Then 
sin[(2k + 1)x] = sin[(2k — 1)x + 2x] 
sin(2k — 1)x cos 2x + cos(2k — 1)x sin 2x 
= P.,-, (sin x\(1 — 2 sin? x) 
+ cos XQo,-2 (sin x)2 sin x cos x, 


Ml 


i] 


which is of the required form P.,,, (sin x). The proof that cos(2k + 1)x = 
cos x Qo, (sin x) is completely similar, and will be left to the reader. oO 


We now return to the proof of the proposition. Notice that, if we set 
x = 0 in sinnx = P, (sin x), we find that P, has constant term zero. Next, 
we take the derivative with respect to x of both sides of sin nx = P, (sin x): 


ncos nx = P,’(sin x) cos x. 


Setting x = 0 here gives: n = P,,'(0), ie., the first coefficient of P,, is n. Thus, 
we may write: 


sin nx 


—— = P,,(sin x) = 1 + a, sinx + a.sin?x +--- 
ei ax( ) + ay, + az + 


+ dg,sin**x (n= 2k + 1), 
where the a, are rational numbers. Note that for x = +(m/n), ..., +(kz/n), 


the left-hand side vanishes. But the 2k values y = +sin(a/n), +sin(2z/n), ..., 
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+sin(kz/n) are distinct numbers at which the polynomial P,,(y) vanishes. 
Since P,, has degree 2k and constant term 1, we must have: 


Py) = (1 - soon) (} 7 xian) ( - singe) - ter) 
-(1~ sabe) (- =atem) 


=I ('~ step) 


Thus 


. k bem} 
sinnx -. . sin? x 
—— = P2,{sin x) = l-- . 
fae El ( sin? =) 


Replacing x by x/n gives: 


sin 7x =11(1 (1 _ Seen) ; 
nsin(rx/n) +4 sin?(ar/n) 
Now take the limit of both sides as n = 2k + 1-> 00. The left-hand side 
approaches (sin 7x)/mx. For r small relative to n the rth term in the product 
approaches 1 — ((mx/n)/(zr/n))? = 1 — (x?/r?). It then follows that the 
product converges to [];2, (1 — (x?/r?)). (The rigorous justification is 
straightforward, and will be left as an exercise below.) 
We conclude: 


2 x? sin(7rx) mx? txt — 8x8 p88 
[Tl -s)eee-1- Stas ata 
ent n 7x 3! 


using the Taylor series for sine. But 


sinh(wx) _ mx? txt — 8x8 788 

7X 7 3! 

If we multiply out the infinite product for sin(ax)/(7x), we get a minus sign 
precisely in those terms having an odd number of x?/n? terms, i.e., precisely 
for the terms in the Taylor series for sin(zx)/(7x) having a minus sign. Thus, 
changing the sign in the infinite product has the effect of changing all of the 
—’s to +’s on the right, and we have the desired product expansion of the 
proposition. (For a ‘‘better” way of thinking of this last step, see Exercise 3 
below.) O 


We are now ready to prove: 


Theorem 4. 


oe 2287? B, 
0k) = (=o apy (-F#)- 
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ProoF. First take the logarithm of both sides of 


sinh(«x) = nxT | (1 ee =) 
n=1 
(for x > 0). On the left we get 
log sinh(wx) = log{(e* — e~*)/2] = log{(e**/2)(1 — e~**)] 
= log(1 — e-9"**) + ax — log 2. 


On the right we get (for 0 < x < 1) 
loga + logx + 2 log(1 + x?/n?) = loga + logx + » Ps (- en 


by the Taylor series for log(1 + x). Since this double series is absolutely 
convergent for 0 < x < 1, we can interchange the order of summation and 
obtain the equality: 


= 2k 2 
log(| — e~?**) + ax — log2 = loga + logx + > (uae a 


bid 2k 
= loga + logx + > (-1)**} = L(2k). 
=1 


We now take the derivative of both sides with respect to x. On the right 
we may differentiate term-by-term, since the resulting series is uniformly 
convergent in 0 < x < 1 — « for any e > 0. Thus, 


Qare ~ 24* 
1 i e@~ 2nx 


mat $2 > (HMR x24), 
x K=1 


Multiplying through by x and then substituting x/2 for x gives: 


ax —1)**+42(2k 
wey + F-1+ 5 Oe ile si ) Sa 


The left-hand side gives: (7x)/2 + >= B,(ax)*/k!. Comparing coefficients 
of even powers of x gives: 7?*B.,/(2k)! = ((—1)**1/2?"-+)¢(2k), which gives 
us the theorem. O 


As examples, we have 


@=5 WF w=s 


The arrangement of the formula for {(2k) in the statement of Theorem 4 
was deliberate. We think of the (— B,/2k) as the “interesting” part, and the 
(— 1)*w?*2?*-1/(2k — 1)! as a nuisance factor. It is the interesting part that 
we end up interpolating p-adically. Some justification for taking (— B2,/2k) 
rather than the whole mess will be given later (§7). For now, let’s remark that 
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at least the 7?" factor has to be discarded when we interpolate the values 
p-adically, since transcendental real numbers cannot be considered p-adically 
in any reasonable way. (What could “p-adic ordinal” mean for them ?) 


2. p-adic interpolation of the 

function f(s) = a° 
This section will eventually play a role in the subsequent logical development. 
It is included at this time as a “‘dry run” in order to motivate certain features 
of the later p-adic interpolation which may otherwise seem somewhat 
idiosyncratic. 

As mentioned before, if a is a fixed positive real number, the function 
J(s) = ais defined as a continuous function of a real variable by first defining 
it on the set of rational numbers s, and then “interpolating” or “extending 
by continuity” to real numbers, each of which can be written as the limit of 
a sequence of rational numbers. 

Now suppose that a = n is a fixed positive integer. Consider m as an 
element in Q,. For every nonnegative integer s, the integer n° belongs to Z,. 
Now the nonnegative integers are dense in Z, in the same way as Q is dense 
in R. In other words, every p-adic integer is the limit of a sequence of non- 
negative integers (for example, the partial sums in its p-adic expansion). So 
we might try to extend f(s) = n° by continuity from nonnegative integers s to 
all p-adic integers s. 

To do this, we must ask if 7° and n® are close whenever the two non- 
negative integers s and s’ are close, for example, when s’ = s + p™ for some 
large N. A couple of examples show that this is not always the case: 

(1) n =p, s = 0: |n® — n*|, = |1 — p?”|, = 1 no matter what N is. 

(2) 1 <n < p: by Fermat’s Little Theorem (see §III.1, especially the first 
paragraph of the proof of Theorem 9), we have n = n? (mod p), and so 
n =n? =n =n” =.--- = n™ (mod p); hence n® — n't?" = n(1 — n?”) 
= n'(1 — n) (mod p); thus |n° — n*|, = 1 no matter what N is. 

But the situation is not as bad as these examples make it seem. Let’s 
choose n so that n = 1 (mod p), say n = 1 + mp. Let |s’ — s|, < 1/p¥, so 
that s’ = s + s"p% for some s” € Z. Then we have (say s’ > s) 

Jn? — nly = |n'ly| — n'y = [1 — nly = | — + my" 
But expanding 


HNC otpN __ 
(1+ pyr = 1 + (stp )mp + SPER —D py + Ompy 


shows that each term in 1 — (1 + mp)**?” has at least p”*+1. Thus, 


- 1 
|n? —n lp s |p’ *?|, a pre 


In other words, if s’ — sis divisible by p”, then n° — n® is divisible by p”*?. 
Thus, if m = 1 (mod p), it makes sense to define f(s) = n° for any p-adic 
integer s to be the p-adic integer which is the limit of n*: as s, runs through any 
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sequence of nonnegative integers which approach s (for example, the partial 
sums of the p-adic expansion of s). Then f(s) is a continuous function 
from Z, to Z,. 

We can do a little better—allowing any n not divisible by p—if we’re 
willing to insist that s and s’ be congruent modulo (p — 1), as well as modulo 
a high power of p. That is, we fix some sy € {0, 1, 2, 3, ..., p — 2}, and, 
instead of considering n° for all nonnegative integers s, we consider n° for all 
nonnegative integers s congruent to our fixed s) modulo (p — 1). Letting 
5S = 59 + (p — 1)s,, we are looking at nm%ot-s: for s, any nonnegative 
integer. We can do this because then 

n= n'o(n? —1)81, 


and for every n not divisible by p we have n?-! = 1 (mod p). Thus, we are in 
the situation of the last paragraph with n?~1 in place of n and s, in place of 
s (and a constant factor n*o thrown in). 

Another way of expressing this function is as follows. Let S,, be the set of 
nonnegative integers congruent to s) mod (p — 1). S,, is a dense subset of Z, 
(Exercise 7 below). The function f- S,,—> Z, defined by f(s) = n° can be 
extended by continuity to a function f: Z, > Z,. Notice that the function f 
depends on sp as well as on 2. But when p = 2 we have sy = 0, and so if n is 
odd, then n* is continuous as a function of all nonnegative integers. 

If n = 0 (mod p), we are out of luck. This is because n —> 0 p-adically for 
any increasing sequence of nonnegative integers. And if s € Z, is not itself a 
nonnegative integer, any sequence of nonnegative integers which approach s 
p-adically must include arbitrarily large integers. It follows that the zero 
function is the only possible candidate for n’, and that’s absurd. 

One final remark: the above discussion applies word-for-word to the 
function 1/n* (Exercise 8 below). 

Now let’s look at the Riemann zeta-function 


oo 


us)= 4 (>. 


nel 

The naive way to try to interpolate {(s) p-adically would be to interpolate 
each term individually and then add the result. This won’t work, because 
even the terms which can be interpolated—those for which p{n—form an 
infinite sum which diverges in Z,. However, let’s forget that for a moment 
and look at the terms one-by-one. 

The first thing we'll want to do is get rid of the terms 1/n* with n divisible 
by p. We do this as follows: 


— 1 1 
= Re + P &(s); 


1 — 1 
US) = TOP Pat 
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It is this last sum 


Me > wa (1- pe 


which we will be dealing with later. This process is known as “taking out 
the p-Euler factor.” The reason is that {(s) has a famous expansion (see 
Exercise 1 below) 


1 
s) = —__—.- 
= 1 Ta 
The factor 1/[1 — (1/q‘)] corresponding to the prime gq is called the ‘‘g-Euler 
factor.” Thus, multiplying ¢(s) by [1 — (1/p‘’)] amounts to removing the 
p-Euler factor: 


1 
*(5) = poaietaeeeery 
ae er 

The second thing we'll want to do when interpolating ((s) is fix 5) ¢€ 
{0, 1,2, ...,p — 2} and only let s vary over nonnegative integers s¢ S,, = 
{s | 5 = 59 (mod p — 1)}. 

It will turn out that the numbers (— B,,/2k) arrived at in §1, when multi- 
plied by (1 — p?*~*), can be interpolated for 2k € Sy, (259 € {0, 2, 4, ..., p — 3}). 
Note that we are not multiplying by [1 — (1/p*)], as you might expect, but 
rather by the Euler term with 2k replaced by 1 — 2k: 1 — (1/p)~**) = 
1 — p?*-1, The reason why this replacement 2k <> 1 — 2k is natural will be 
discussed in §7. (We’ll see that the “‘interesting factor’? — Bp,/2k in (2k) 
actually equals ((1 — 2k); {(x) and &(1 — x) are connected by a “functional 
equation.”’) 

More precisely, we will show that, if 2k, 2k’ € S2,, (where 2k € {2, 4, ..., 
Pp — 3}; there’s a slight complication when ky = 0), and if k = k’ (mod p’), 
then (see §6) 


(1 — p*~*)(— Ba,/2k) = (1 — p?*’~*)(— Bax-/2k’) (mod p**?). 
These congruences were first discovered by Kummer a century ago, but their 


interpretation in terms of p-adic interpolation of the Riemann ¢-function was 
only discovered in 1964 by Kubota and Leopoldt. 


EXERCISES 


1. Prove that 
1 
s)= ——— fors>1. 
(8) ate (1 -— q7*) 
2. Prove that 


kK (1 — sin?(¢x/n)/ sin?(ar/n)) an 


I] 


tel (1 — x?/r?) 1 asn=2k+1—>0. 
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Exercises 


Use the relationship e** = cos x + isin x for e to a complex power to show 
that sinh x = —isin ix. Give another argument for how the infinite product 
for sinh x follows from the infinite product for sin x. 


Prove that B, = 0 if k is an odd number greater than 1. 


Use the formula for ¢(2k), along with Stirling’s asymptotic formula n! ~ 
V2nn n"e-" (where ~ means that the ratio of the two sides +1 as n —> 00) to 
find an asymptotic estimate for the usual Archimedean absolute value of Box. 


Use the discussion of n° in §2 to compute the following through the p*-place: 
(i) 1146 in Os (ii) V1/10 in Q, (iii) (— ©)2*+4°7+ 3-77 +79 + in Qz. 


. Prove that for any fixed sy € {0, 1,..., p — 2}, the set of nonnegative integers 


congruent to so modulo (p — 1) is dense in Z,, i.e., any number in Z, can be 
approximated by such numbers. 


What happens to the discussion in §2 if we take n € Z, instead of taking n to 
be a positive integer? What happens if we replace the function f(s) = n° by 
f(s) = 1/n’? Note that this is the same as replacing ‘‘ nonnegative integer’’ by 
“‘nonpositive integer’’ when defining the dense subset of Z, from which we 
extend f. 


Let x be the function on the positive integers defined by: 


1, ifm = 1 (mod 4); 
x() =<-1, ifm = 3 (mod 4); 
0, if 2|n. 
Define L,(s) = 2r=1 (x(@/n‘) = 1 — (1/35) + (1/55) — (1/78) +--+. Prove 


that L,(s) converges absolutely if s > 1 and conditionally if s > 0. Find 


L,(). Find an Euler product for L,(s) and for L,*(s) a Dnz1,pin (x(n)/n*). (It 


turns out that there is a formula similar to Theorem 4 for L,(2k + 1) (i.e., for 
positive odd rather than even integers) with B, replaced by 


Pee : sen Ae te* ( -t 

nn =n! times the coefficient of ¢” in ea eo (=z) 
Note: Exercise 9 is a special case of the following situation. Let N be a 
positive integer. Let (Z/NZ)* be the multiplicative group of integers prime 
to N modulo N. Let x:(Z/NZ)* + C* be a group homomorphism from 
(Z/NZ)* to the multiplicative group of nonzero complex numbers. (It is easy 
to see that the image of x can only contain roots of 1 in C.) Suppose that x is 
“primitive,” which means that there is no M dividing N, 1 < M < N, such 
that the value of x on elements of (Z/NZ)* only depends on their value 
modulo M. Consider x as a function on all positive integers ” by letting 
x(n) equal y(n modulo N) if m is prime to N and y(n) = 0 if m and N have a 
common factor greater than 1. x is called a “character of conductor N.” 

Now define 


LS) g > ©. 
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It can then be shown that for x nontrivial a formula similar to Theorem 4 holds 
with B, replaced by 
* x(a)te* 


By. =, n! times the coefficient of 1” in SS 4 
a=1 a 


The formula gives L, of even integers if y(—1) = 1 and L, of odd integers if 
x(-1) = —1. See Iwasawa, Lectures on p-adic L-functions.) 
In addition, we have the formula 


ete 
1 or, ify(-1) =1; 
L,() = ps x . by X(a) log sin 2 7 if x(—1) 


N-1 


rs 2-4, if x(-1) = -1, 


where the bar over x denotes the complex conjugate character: ¥(a) & x(a), 
and where 


™ = 


def 


N-1 
> x(a)e2™n 
a=1 


(this is known as a “‘ Gauss sum”). (For a proof, see Borevich and Shafarevich, 
Number Theory, p. 332-336.) 


Use the formula for L,(1) in the above note to check the value for L,(1) in 
Exercise 9 and to prove that: 

Pe Wei de oT 1 1 1 1 
Oot a sa 8 i SRS Ok ED. 

wks 

~ 35/3 

1 1 1 1 1 1 1 1 1 1 1 
OE 35° 9 oa is 1s i 1s a os 

_ log(l + 2). 
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p-adic distributions 


The metric space Q, has a “‘basis of open sets’’ consisting of all sets of the 
form a+ p*Z, = {xe Q, | |x — al, < (1/p%)} for ae Q, and NeZ. This 
means that any open subset of Q, is a union of open subsets of this type. 
We shall sometimes abbreviate a + p’Z, as a + (p”), and in this chapter 
we shall call a set of this type an “‘interval’’ (in other contexts we often call 
such a set a “‘disc’’). Notice that all intervals are closed as well as open, 
since the complement of a + (p¥) is the union over all a’ <Q, such that 
a’ éa + (p”) of the open sets a’ + (p”). 


Recall that Z, is sequentially compact: every sequence of p-adic integers 


has a convergent subsequence (see Exercise 19 of §1.5). The same is easily 
seen to be true for any interval or finite union of intervals. In a metric space X, 
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the property of sequential compactness of a set S © X is equivalent to the 
following property, called ‘“‘compactness’’: every time S is contained in a 
union of sets, it is contained in a union of finitely many of those open sets 
(‘“‘every open covering has a finite subcovering’’). (See Simmons, Introduction 
to Topology and Modern Analysis, §24, for this equivalence; this book is also 
a good standard reference for other concepts from general topology.) It then 
follows (see Exercise | below) that an open subset of Q, is compact if and 
only if it is a finite union of intervals. It is this type of open set, which we call 
a “‘compact-open,” that repeatedly occurs in this section. 


Definition. Let X and Y be two topological spaces. A map f/f: X¥ > Y is 
called locally constant if every point x € X has a neighborhood U such 
that f(U) is a single element of Y. 


It is trivial to see that a locally constant function is continuous. 

The concept of a locally constant function is not very useful in classical 
situations, because there usually aren’t any, except for constants. This is the 
case whenever X is connected, for example R or C. 

But for us X will be a compact-open subset of Q, (usually Z, or Z,* = 
{x eZ, | |x|, = 1}). Then X has many nontrivial locally constant functions. 
In fact, f: X > Q, is locally constant precisely when / is a finite linear com- 
bination of characteristic functions of compact-open sets (see Exercise 4 
below). 

Locally constant functions play the same role for p-adic X that step- 
functions play when X = R in defining integrals by means of Riemann 
sums. 

Now let X be a compact-open subset of Q,, such as Z, or Z,*. 


Definition. A p-adic distribution » on X is a Q,-linear vector space homo- 
morphism from the Q,-vector space of locally constant functions on ¥ to 
Q,. If f: X — Q, is locally constant, instead of writing u(f) for the value 
of pu at f, we usually write J Sp. 


Equivalent definition (see Exercise 4 below). A p-adic distribution » on X is 
an additive map from the set of compact-opens in X to Q,; this means that 
if U < X is the disjoint union of compact-open sets U,, Uz, ..., U,, then 


HCY) = (U1) + w(U2) +--+ + w(U,). 


By ‘‘equivalent definition,” we mean that any » in the second sense 
“extends” uniquely to a wu in the first sense, and any wp in the first sense 
“restricts”’ to a « in the second sense. More precisely, if we have a distribution 
pin the sense of the first definition, we get a distribution (also denoted ,) in 
the sense of the second definition by letting 


w(U) = J (characteristic function of U)p, 
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for every compact-open U. If we have a distribution » in the sense of the 
second definition, we get a distribution in the sense of the first definition by 
first letting 


| (characteristic function of U)u = p(U), 


and then defining J Sv for locally constant f by writing fas a linear combina- 
tion of characteristic functions. 


Proposition. Every map p from the set of intervals contained in X to Q, for 
which 


-1 


u(a + (p%)) = > pla + bp® + (p¥*4) 


b=0 
whenever a + (p) < X, extends uniquely to a p-adic distribution on X. 


Proor. Every compact-open U < X can be written as a finite disjoint union 
of intervals: U = LU J, (see Exercise 1). We then define n(U) = > pi). 
(This is the only possible value of u(U) if » is to be additive.) To check that 
p(U) does not depend on the partitioning of U into intervals, we first note 
that any two partitions U = J J,and U = [J J of U into a disjoint union of 
intervals have a common subpartition (‘‘finer” than both) which is of the 
form J, = LU, 4;, where, if J, = a + (p%), then the J,’s run through all 
intervals a’ + (p’’) for some fixed N’ > N and for variable a’ which are 
=a (mod p”). Then, by repeated application of the equality in the statement 
of the proposition, we have: 


pN’-N-1 


wh) = wa + (p")) = D(a + jp’ + (p")) = 2 u(). 


J=0 


Hence >; #(/) = >i,; u(Z,;). Thus, >; ed) = >; v(Z’), because both sides equal 
the sum over the common subpartition. It is now clear that » is additive. 
Namely, if U is a disjoint union of U;, we write each U, as a disjoint union of 
intervals J,;, so that U = L,,; J;, and 


w(U) = > phy) = y2 p w(1,) = 2 (U,). o 


td 
We now give some simple examples of p-adic distributions. 
(1) The Haar distribution py... Define 


1 
Huaer(a + (p™)) def pY 


This extends to a distribution on Z, by the proposition, since 


p-l p-1 1 1 
Bi Puear(@ + bp’ + (p%*?)) = > N+1 — ON 
v=o v=0 P P 


Haar(@ + (p*)). 
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This is the unique distribution (up to a constant multiple) which is “transla- 
tion invariant,” meaning that for all a € Z, we have piyaar(a + U) = Myaar(V), 
where a + U = {xeZ,| x — ae U}. 

(2) The Dirac distribution », concentrated at « € Z, (« is fixed). Define 


1, ifeaeU; 
nO) za {5 


It is trivial to check that pu, is additive. Note that J Sta = f(a) for locally 
constant f. 

(3) The Mazur distribution yur. First, without loss of generality, when 
we write a + (p%) we may assume that a is a rational integer between 0 and 
p™ — 1. Assuming this, we define 


otherwise. 


a 1 
PMazur(@ + (p*)) det pv oe 


We postpone the verification that umazur has the additivity property in the 
proposition, since this will come as a special case of a more general result in 
the next section. 

Notice one important difference between the distributions pyaar and 
Mazur and classical measures. In these two p-adic examples, as the interval 
being measured “shrinks” (i.e., as N-—> oo), its measure in terms of pu 
increases as a number in Q,, namely: 


1 
|Huaar(a oF (P”))|p = tr = PY; 
IP \p 
and, if p { a (and if N > 1 in the case p = 2), then 
[nsec + (Plo = |% — 3]. = P®- 
UW: Pp ps 2 ei 


We'll deal with this peculiarity later. 


EXERCISES 


1. Give a direct proof that Z, is compact (i.e., that any open covering of Z, has a 
finite subcovering). Then prove that an open set in Z, is compact if and only if 
it can be written as a finite disjoint union of intervals. Note that any interval 
can be written as a disjoint union of p “equally long” subintervals: a + (p") = 

P-$a + bp™ + (p"*'). Prove that any partition of an interval into a disjoint 
union of subintervals can be obtained by applying this process a finite number 
of times. 


2. Give an example of a noncompact open subset of Z,. 


3. Let U be an open subset of a topological space X. Show that the characteristic 
function f:X — Z defined by 


fay = {F ifxe U; 


0, otherwise, 
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is locally constant if X = Z, and U is a compact-open but is not locally 
constant for any open set U if X = R (unless U is R itself or the empty set). 


4. Let X be acompact-open subset of @,. Show that f: X > Q, is locally constant 
if and only if it is a finite linear combination with coefficients in Q, of charac- 
teristic functions of compact-opens in X. Then prove that the two definitions 
of a distribution on X are equivalent. 


5. If ce Q,, |a|p = 1, show that pyear(®@U) = pyear(U) for all compact-open U, 
where aU denotes {ax|x € U}. 


6. Let f: Z, > Q, be the locally constant function defined by: f(x) = the first 
digit in the p-adic expansion of x. Find f fu when: (1) «4 = the Dirac distribu- 
tion a; (2) KB = Huaar; (3) = Pnazur- 


7. Let p be the function of intervals a + (p”) which is defined as follows ([ ] = 
greatest integer function): 


p "+21 if the first [N/2] digits in a corresponding to odd 
pa + (p™)) = powers of p vanish; 
0, otherwise. 


Prove that w extends to a distribution on Z,. 


8. Discuss how one could go about making up examples of p-adic distributions u 
on Z, with various growth rates (i.e., rates of growth of maxo<a<p¥|u(a + (P%))|p 
as N increases). 


4. Bernoulli distributions 


We first define the Bernoulli polynomials B,(x). Consider the function in 


two variables t and x 
— (xt)* 
-(> 2. By i) (> 2 k! oy). 


k=0 


In this product, we collect the terms with ¢*, obtaining for each k a poly- 
nomial in x, and we define B,(x) to be k! times this polynomial: 


= > B(x) 5 kr 


k=0 


The first few Bernoulli polynomials are: 


B(x) = 1, B(x) = x — 4, B(x) = x? —x + 4, 
B(x) = x° — 3x? + 4x,.... 


Throughout this section, when we write a + (p”) we will assume that 
0 <a < p™ — 1. Fix a nonnegative integer k. We define a map yz, on 
intervals a + (p”) by 


Bp,x(a + (p™)) = pre-op, S)- 
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Proposition. 4; extends to a distribution on Z, (called the “kth Bernoulli 
distribution”’). 


Proor. By the proposition in §3, we must show that 
p-1 
ea + (p™)) = > Hp.x(a + bp’ + (p¥*)). 
b=0 


The right-hand side equals 


ye 2 (a t+ bp 
prrpue-D > Baer). 


b=0 


so, multiplying by p-"“-» and setting « = a/p’**, we must show that 
p-1 b 
B,(pa) = p*~* > Be(« + *). 
b=0 Dp 


The right-hand side is, by the definition of B,(x), equal to k! times the coeffi- 
cient of t* in 


mm Fe - 1 e-—1 & e—1 eP—]’ 


-1 7 -1 = 
poy eee eS et ee 


by summing the geometric progression >?=4 e/?. This expression equals 


ke, t (pa)tip hea t J 
ROT oP 2, Ble) 
s=0 J: 


again by the definition of B,(x). Hence, k! times the coefficient of t* is simply 


PrB(pa)(2)" = (pa), 


as desired. O 


The first few B,(x) give us the following distributions: 
Hp,o(a + (p%)) = p-%, i.€., 4e,0 = Muaars 


a a 1, 4 
Bela + (p™)) = ,(5) aa py — > L€., 3,1 = !Mazur> 


a a_i 
pala + (P")) = o*( Se - 4 + 3); 


and so on. 

It can be shown that the Bernoulli polynomials are the only polynomials 
(up to a constant multiple) that can be used to define distributions in this way. 
We shall not need this fact, and so will not prove it. But it should be noticed 
that the Bernoulli polynomials B,(x) have appeared in an important and 
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unique role in p-adic integration. This will turn out to be related to the 
appearance of the Bernoulli numbers 8, (which are the constant terms in 
the B,(x); see Exercise 1 below) in the formula for (2k). 


5. Measures and integration 


Definition. A p-adic distribution » on X is a measure if its values on compact- 
open Uc X are bounded by some constant Be R, i.e., 


|4(U)|, < B for all compact-open Uc X. 


The Dirac distribution p, for fixed «€ Z, is a measure, but none of the 
Bernoulli distributions are measures. There is a standard method, called 
“regularization,” for turning Bernoulli distributions into measures. We first 
introduce some notation. If « € Z,, we let {a}, be the rational integer between 
0 and p® — 1 whichis = « (mod p”). If wis a distribution and « € Q,, we let 
a denote the distribution whose value on any compact-open is « times the 
value of uw: (az)(U) = a-(u(U)). Finally, if U < Q, is a compact-open set 
andaeQ,,a # 0, weletaU = {xeQ, | x/« € U}. It is trivial to check that the 
sum of two distributions (or measures) is a distribution (resp. measure), any 
scalar multiple op of a distribution (or measure) p is a distribution (resp. 
measure), and, if «¢ Z,* and if yu is a distribution (or measure) on Z,, then 
the function p’ defined by p’(U) = p(aU) is a distribution (resp. measure) 
on Z,. 

Now let a be any rational integer not equal to 1 and not divisible by p. 
Let 5,x,«—or, more briefly, y,,.—be the ‘‘regularized”’ Bernoulli distribution 
on Z, defined by 


PeaU) = bax(U) — aug .(aU). 


We will soon show that j,,, is a measure. In any case, it’s clearly a distribu- 
tion by the remarks in the last paragraph. 

We easily compute an explicit formula when k = 0 or 1. For k = 0, 
Hs,0 = Muaar, and it is easy to see that 4o,,(U) = 0 for all U (see Exercise 5 of 
§3). If k = 1, we have 


H1,0(a + (p%)) = - = ; . (Se . 2) 


=! «8-18 -[B) 


(where [ ] means the greatest integer function) 


“24 


Proposition, |;,.(U)|, < 1 for all compact-open U © Z,. 


ProorF. Notice that («-+ — 1)/2 € Z,, since 1/2 € Z, and 1/2 € Z, unless p = 2. 
If p = 2, then a~! — 1 = 0 (mod 2), and we're still OK. Since [aa/p”] € Z, 
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it follows by the above formula that », ,(a + (p”))€Z,. Then, since every 
compact-open U is a finite disjoint union of intervals J,, we may conclude that 
|H#1,0(U)|p = max \Hr,aCh)|p s 1. O 


Thus, 41,2 is a measure—the first interesting example of a p-adic measure 
that we’ve come across. In fact, we'll soon see that 4, ., plays a fundamental 
role in p-adic integration, almost as fundamental as the role played by “dx” 
in real integration. 

We next prove a key congruence that relates p,.¢ to 41,4. The proof of 
this congruence at first looks unpleasantly computational, but it becomes 
more transparent if we think of an analogous situation in real calculus. 
Suppose that in taking integrals such as J f(’x)dx we want to make the 
change of variables x +> x*, i.e., to evaluate if S(x*)d(x*). The simple rule is: 
d(x*)/dx = kx*-}, Actually, d(x) can be thought of as a ‘“‘measure”’ », on 
the real number line, which is defined by letting y,({a, 6]) = b* — a*; then 
#4, is the usual concept of length. The relation d(x*)/dx = kx*~1 actually 
means 

. Hx([a, 5]) _ Bpk-1 
meaae) 


Thus, in the Riemann sums > f(x,)u,(4,) in the limit as the J,’s all become 
smaller we may replace y,(1,;) by kx*~14,(/,) and get J S(x)kx*~? dx. 

The actual proof that lim,.., 4([a, b))/u:([a, b]) = ka*~} uses the binomial 
expansion for (a + h)* (where h = b — a)—actually, only the first two terms 
a*® + kha*-* really matter. Similarly, in the p-adic case, when we show that 
Peal) ~ ka*-*p, (1) if J is a small interval containing a, we also use the 
binomial expansion. Thus, Theorem 5 should be thought of as analogous to 
the theorem that (d/dx)(x") = kx*~1 from real calculus. (Forget about the d, 
on both sides of the congruence in Theorem 5; all it means is that, when we 
divide both sides by d,, we must replace p” by p% ~ °4n%, where ord, d,, is just 
a constant which doesn’t matter for large N.) 


Theorem 5. Let d,, be the least common denominator of the coefficients of B,{x). 
Thus: d, = 2, d, = 6, ds = 2, etc. Then 


dy My, g(a + (p%)) = dyka*~* yy ,(a + (p*)) (mod p*), 
where both sides of this congruence lie in Z,. 
Proor. By Exercise I below, the polynomial B,(x) starts out 
Box* + kB,x*-} ee tees 
Now 


dala + (p)) = aepr*-( (4) ~ «, (2). 


The polynomial d,,B,(x) has integral coefficients and degree k. Hence we 
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need only consider the leading two terms d,x* — d(k/2)x*-} of d,B,(x), 
since our x has denominator p’, so that the denominators in the lower terms 
of d,,B,(x) will be canceled by p’*~» with at least p% left over. We also note 
that 


aa = {axa}, (mod p”), 
and 


Lea = [= ({ ] = greatest integer function). 


Hence 


ke k 
dana + (p")) = dyph*-( Se — a-¥( hs) 


k({ ako} k-1 
“fee (2) ome 

a*® 5 = a a|\* 

= a(S = a ky (SF = Fal 


kf ,_ r » a a}\*-2 
— (a> — esoron( — [FN))) 


= Ce =. aHak-tat~"))) (mod p”) 


Ifea] . lla —1 
= k-1f of} =" 
se ({F| ag 


= d,ka*~ "py, ,9(a + (p”)). O 


Corollary. j,,. is ameasure for allk = 1,2,3,...andanyaeéZ,a¢ pZ,a # 1. 
Proor. We must show that u,..(@ + (p”)) is bounded. But by Theorem 5, 


A 


[Mx,al@ + (p™))|p < max ( 


pn 
A olka tps (a + (P")Is) 


oo lieala + (Pn): 


IA 


( 1 
max 


a, 
But |y:,.(a + (p*))|, < 1, and d,, is fixed. O 


What is the purpose of going to all this fuss to modify (‘‘regularize”’) 
Bernoulli distributions to get measures? The answer is that for an unbounded 
distribution p, J Jp is defined by definition as long as / is locally constant, 
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but you run into problems if you try to use limits of Riemann sums to extend 
integration to continuous functions /. 

For example, let 1 = pyaar, and take the simple function f: Z, + Z, given 
by f(x) = x. Let’s form the Riemann sums. Given a function f, for any N we 
divide up Z, into ()2%%' (a + (p")), we let xq, be an arbitrary point in the 
ath sniterval, and we define the Nth Riemann sum of / corresponding to 


{Xa,n} as 
pN-1 


Swtxawl) & o F(Xa,n)e(a + (p*)). 


In our example, this sum equals 


> e ha 
a py 
For example, if we simply choose x,y = a, we obtain 


<> - pox (Be a DO) pt. 
a=o 2 2 
This sum has a limit in Q, as N — oo, namely — 1/2. But if, instead of x..y = 


acea + (p*), we change one of the x.,y to a + agp’ € a + (p’) for each N, 
where ay is some fixed p-adic integer, we then obtain 


pN-1 N 
p* > a + agp") = 2 La as 


a=0 


whose limit is ag — 4. Thus, the Riemann sums do not have a limit which is 
independent of the choice of points in the intervals. 

A “measure” is not much good, and has no right to be called a measure, 
if you can’t integrate continuous functions with respect to it. (This is a slight 
exaggeration—see Exercises 8-10 below.) Now we show that bounded 
distributions earn their name of ‘‘measure”’. 

Recall that X is a compact-open subset of Q,, such as Z, or Z,*. (For 
simplicity, let X < Z,.) 


Theorem 6. Let u be a p-adic measure on X, and let f: X — Q, be a continuous 
function. Then the Riemann sums 


Swtzam G&D, f%an)ua + (P%) 
Osa<p 
at+(pNycXx 
(where the sum is taken over all a for which a + (p™) < X, and xXq,n is 
chosen in a + (p™)) converge to a limit in Q, as N-—> 00 which does not 
depend on the choice of {Xq,y}. 


Proor. Suppose that |u(U)|, < B for all compact-open Uc X. We first 
estimate for M > N 


| Sw. ten) _ Swtxaurlp- 
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By writing X as a finite union of intervals, we can choose N large enough so 
that every a + (p”) is either < X or disjoint from X. We rewrite Sy,ix.y) aS 
follows, using the additivity of p: 


aio Leandula + (p*9) 


a<p 
at+(pMycX 


(where @ denotes the least nonnegative residue of a mod p”). We further 
assume that N is large enough so that | f(x) — f(y)|,» < « whenever x = y 
(mod p”). (Note that continuity implies uniform continuity, since X is 
compact; this is an easy exercise, or else see Simmons’ book.) Then 


|Sw,tx0,0) Fo Su.txe.molp = 


> _, Se%a.n) — f&am)ela + (p)) 


Osa<p 
a+(pM)cXx 


max(|f(Xa,n) — f(%a,m)|p° |e(a + (p™))],) 


< «B, 


IA 


since Xa, = Xq,m (mod p*). Since « is arbitrary and B is fixed, the Riemann 
sums have a limit. 
It follows similarly that this limit is independent of {x,,,}. Namely, 


| Sw. ¢xe,n) — Sy.cx'e.wlp = 


> Fran) — f0%a.w)e(a + (p%)) 


Osa<pN 
at+(pN)ycX 


maxa(|f(%a,n) — f(Xa,n) |p: |u(a + (p"))|p) 
< &B. oO 


1A 


Definition. If f: ¥ — Q, is a continuous function and yu is a measure on X, we 
define f Ju to be the limit of the Riemann sums, the existence of which was 
just proved. (Note that, if fis locally constant, this definition agrees with 
the earlier meaning of J Sp.) 


The following simple but important facts follow immediately from this 
definition. 


Proposition. [ff: X — Q, is a continuous function such that | f(x)|, < A for all 
x € X, and if |u(U)|, < B for all compact-open U < X, then 


| f 


Corollary. If f, g: X — Q, are two continuous functions such that | f(x) — g(x)\p 
< for all x € X, and if |u(U)|, < B for all compact-open U < X, then 


| fm — fo 


<A-B. 
P 


< eB. 
Dp 


Exercises 


EXERCISES 


1. 


2. 


- 


2° 


10. 


Show that B(x) = dkeo (f)Bx*-', and, in particular, B,(0) = B,. Further 
show that 


[ ” B(x) dx = 
re) 


ny 


1 ifk =0, 


d 
0 otherwise, and that = = Bi(x) = kBr-1(x). 


Prove that no distribution (except for the identically zero distribution) has 
the property that 


max |u(a + (p"))|, > 0 asN>o. 
Osa<pN 


What is 5,.(Zp)? ba,x(pZp)? bs,x(Zp*)? 


. Prove that a p-adic distribution yp is a measure if and only if for some nonzero 


ae Z, the distribution a- y takes values in Z,. Prove that the set of measures on 
X is a Q,-vector space. 


Express fx,a(Zp) and px,.(Zp*) in terms of « and k. Find fz,* fy if f(x) = 
Diao ax'. 


Let p be an odd prime. For anya = 0, 1, ..., p* — 1, let Sa denote the sum of 
the p-adic digits in a. Show that x(a + (p")) = (—1)5* gives a measure on Z,, 
and that Sz, Su = f() for any continuous even function / (i.e., for which f(—~x) 


= f(x)). 


. Let p> 2, f(x) = I/x, and a= 1+ p. Prove that fz x fu;,,= —1 (mod p). If 


p =2 let « = 5, and prove that 7, fuy,, = 2 (mod 4). 


A distribution » on X is called “‘boundedly increasing” if maxo<a<p% 
|p\u(a + (p"))|p > 0 as N > 0, i-e., » “increases strictly slower than fuser.” 
Prove that Theorem 6 holds for yu if we assume that f: X > Q, satisfies the 
Lipschitz condition: there exists an A € R such that for all x, ye X 


f(x) — f(y) |p < Alx — yp. 


(This concept was introduced by Manin and applied by him to p-adic inter- 
polation of certain Hecke series.) 


Let » be the distribution defined in Exercise 7 of §3. Check that » is boundedly 
increasing. Let f: Z, — Z, be the function f(x) = x. Evaluate J Su, which we 
know is well defined by the previous problem. 


Let r be a positive real number. A function f: Z, — Q, is called (by Mazur) 
“of type r” if there exists A € R such that for all x, x’ €Z, we have 


f(x) — FV» S Alx — x’|,”. 


Note that any such function is continuous. If r > 1, then f is Lipschitz (see 
Exercise 8). Now let x be a p-adic distribution on Z, such that for some 
positive se R 
p Ns max |u(a +(p"))|, > 0 asN—-o. 
Osa<py 


Prove the analogue of Theorem 6 for such a p» and functions of type r when- 
everr>s. 
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6. The p-adic ¢-function as a Mellin-Mazur 
transform 
If X is a compact-open subset of Z,, any measure p» on Z, can be restricted to 


X. This means we define a measure -* on X by setting »*(U) = w(U) when- 
ever U is a compact-open in X. In terms of integrating functions, we have 


| it = J f- (characteristic function of X)y. 


We shall use the notation f xJ for this restricted integral f Spr. 
We said that what we want to interpolate is — B,/k. We have the simple 
relationship 


fo toon = Hoa(Zp) = By 


(see Exercise 3 of §5). Hence we want to interpolate the numbers 
—(1/k) Jz, I up, x: 

For different k are the distributions ys, related to each other in any 
straightforward way? Not quite, but the regularized measure p,q is related 
tO #1,¢ by Theorem 5. More precisely, we have the following corollary of 
Theorems 5 and 6: 


Proposition. Let f:Z,—>Z, be the function f(x) = x*—* (k a fixed positive 
integer). Let X be a compact-open subset of Z,. Then 


i Ip ,0 = al Sta 
x x 


Proor. By Theorem 5, we have 
Pica(@ + (p™)) = ka*~*py,o(a + (p%)) (mod p¥~ 4%). 


Now, assuming that N is large enough so that X is a union of intervals of the 
form a + (p”), we have 


[ Ine = > Hala + (p")) 


Osa<p 
a+(pNycXx 
= ka*yy,q(a + (p")) (mod p*-s4e) 
Osa<pN 
at+(pNycx 
=k > f@pc(a + (p%)). 
Osa<pN 
at+(pN)cXx 
Taking the limit as N > 00 gives fy lpn. =k Sx fili,0 Oo 


If we replace f by x*~1 in our notation, treating x as a “variable of inte- 
gration,” we may write this proposition as 


J Mena = kf x" 41 ae 
x x 
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The right-hand side looks much better than the left hand side from the 
standpoint of p-adic interpolation, since instead of k appearing mysteriously 
in the subscript of », it appears in the exponent. And we know from §2 what 
the story is for interpolating the integrand x*~? for any fixed x (see also 
Exercise 8 of §2). Namely, we’re in business as long as x # 0(mod p). To 
make all of our x’s in the domain of integration have this property, we must 
take X = Z,*. 

Thus, we claim that the expression i x*~1u, , can be interpolated. To 
do this, we combine the results of §2 with the corollary at the end of §5. That 
corollary tells us that if |f(x) — x*-4|, < efor xeZ,”, then 


J . ia — i ae 
Zp Zp 

(recall that |u,,.(U)|, < 1 for all compact-open U). Choose for f the function 
x*-1, where k’ = k (mod p — 1) and k’ = k (mod p”) (writing this as one 
congruence: k’ = k (mod (p — 1)p”)). By §2, we have: 


se 


[xk -2 — xF-1], << a for xeZ,*. 


(= ane nef xa 


We conclude that, for any fixed s, € {0, 1,2, ...,p — 2}, by letting & run 
through S,, = {positive integers congruent to so (mod p — 1)}, we can 
extend the function of k given by i x*-14, , to a continuous function of 
p-adic integers s: 


Thus, 
1 


pe pyar 


+s(p-1)-1 
ee 1a 
Zp 


But we have strayed a little from our original numbers —(1/k) f z, |HB,K- 
We just saw that we can interpolate 


I Pez Ipx,a- 
Zp 


Let’s relate these two numbers: 


1 1 : 
k I, I pk ,a = k He,a(Zp ) 
a i (1 — a@-*)(1 — p*-1)B, (see Exercise 5 of §5) 


1 
== 0 = p9(-7f) Maen): 
The term 1 — p*~+ made its appearance because we had to restrict our 


integration from Z, to Z,*. This is the phenomenon predicted at the end of 
§2: because we can not interpolate n* when p|n, we must remove a “p-Euler 
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factor” from the ¢-function before it can be interpolated. So we will inter- 
polate the numbers (1 — p*~1)(— B,/k): 


_ B 1 i 
(1 — p* »(- 3) Tg E a Tle x My 


One slight embarrassment, which we warned of in §2, is that the Euler term 
is 1 — p*-1 and not 1 — p~* as you might think it should be from the 
heuristic discussion in §2. It’s as though, instead of f(k), we were really 
interpolating “(1 — k)”’ (we haven’t yet defined what this means for 
positive k). So we define our p-adic ¢-function to have the value (1 — p*-1). 
(—B,/k) at the integer 1 — k, not at k itself. 


Definition. If k is a positive integer, let 
£1 — k) si — p*-*)(—-B,/k), 
so that, by the preceding paragraph, 


t(l- =o oe 

Note that the expression on the right does not depend on «, i.e., if B € Z, 
P{B,B # 1,then(s-* — es ae x lng = (a — I)" f, x1, 9, Since 
both equal (1 — p*-1)(—B,/k). This equality—this independence of a—can 
also be proved directly (see Exercise 1). We shall use this independence of « 
later, when we define ¢,(s) for p-adic s. 

But we first derive some classical number theoretic facts about Bernoulli 
numbers. These facts were considered to be elegant but mysterious oddities 
until their connection with the Kubota—Leopoldt ¢, and Mazur’s measure 
41,4 revealed them as natural outcomes of basic “‘calculus-type”’ considera- 
tions (namely, the corollary at the end of §5, which says, roughly speaking, 
that when two functions are close together on an interval, so are their 
integrals). 


Theorem 7. (Kummer for (1) and (2), Clausen and von Staudt for (3)). 
(1) Ifp — 1k, then |B,/k|, < 1. 
(2) Ifp — ljk andifk = k' (mod(p — 1)p%), then 
(1 pt-4) Ze = (1 — p¥-4) FE (mod p¥*?), 
(3) If p — 1|k (or if p = 2 and k is even or k = 1), then 
pB, = —1 (mod p). 


Proor. We assume p > 2, and leave the proof of (3) when p = 2 as an exer- 
cise (Exercise 6 below). 
We need a fact which will be proved at the beginning of the next chapter 
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(at the end of §III.1): There exists an a € {2, 3, ..., p — 1} such that «?—! is 
the lowest positive power of « which is congruent to 1 modulo p. Put another 
way, the multiplicative group of nonzero residue classes of Z mopulo p is 
cyclic of order p — 1, i.e., there’s a generator a € {2, 3, ..., p — 1} such that 
the least positive residues of «, «?, a, ..., a?~1 exhaust {1, 2, 3, ...,p — 1}. 

In the proof of parts (1) and (2), we choose our “‘measure regularizer”’ « 
to be such a generator in {2, 3, ..., p — 1}. This means that, since p — 1{(—k), 
we have «~* # | (mod p), so that (a~* — 1)-1e€Z,”*. 

To prove (1), we write (assuming k > 1; if k = 1 and p > 2, then 
|By/Ilp = |-1/2|p = 1): 


|Biklp = |Wa-* — Ill — p*~ Lol x el | 


Zp* 
k-1 
| e x F1,0 
Zp 


Pp 


by the proposition at the end of §5 (with A = B = 1), since |p,,.(U)|p < 1 
for all compact-open sets U < Z,* and [x*-1|, < 1 for all xeZ,*. 
To prove (2), we rewrite the desired congruence as 


cae Eieees Sere | “-1 N+1 
= ils es Pia Sa ae i Je," x 1a, (mod p¥ +1), 


a7* 

Notice that, if for a, b, c, dé Z, we have a = c (mod p") and b = d(mod p"), 
then we also have ab = cb = cd(mod p"). Thus, since a = (a-* — 1)7}, 
b= fee Me ¢ = (a — 107%, and d= f, . x* 4), are in Z,, it 
suffices to prove that («-* — 1)"! = (a-* — 1)-!(mod p¥t?) and 
i x*-1y = i x*-14, ,(mod p%*1). The first reduces to a* = a 
(mod p”*+), and the second reduces (using the corollary at the end of §5, 
with B = 1 and « = p~%~?) to showing that x*~? = x*-1 (mod p”*?) for 
all x € Z,*. But this all follows from the discussion in §2. 

Finally, we prove the Clausen-von Staudt congruence. For this let « = 
1 + p. Recall that we are proving it for p > 2. We have 


—kp k-1)-1 -1 
ak 7 — p*~*) i 5 cael See 


First take the first of the three terms on the right. If we let d = ord, k, then 
a-* — 1] = (1 + p)-* — 1 = —kp (mod p**?), 


PB, = —kp(—B,/k) = 


so that 


ak — | 


l= 


(mod p). 


Next, since k must be >2, we have (1 — p*-1)-! = 1 (mod p). Thus, 


pB, = [, . 2 th1,« (mod p). 
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Again using the corollary at the end of §5, this time with f(x) = x*-1 and 
g(x) = 1/x, we obtain 


pB, = I,  * 7 "1, (mod p). 
P 
But by Exercise 7 of §5, this latter integral is congruent to —1 (mod p). O 
We now return to p-adic interpolation. 


Definition. Fix s) ¢{0,1,2,...,p — 2}. For s€Z,(s #0 if so = 0), we 
define 


1 
— +(p-1)s-1 
Co.solS) der Gath D — ] omens Hiya: 
Pp 


It should by now be clear that this definition makes sense, namely, 
a Got P®-D9 = @-So(aP-*)-§ and x%ot®-Ys-1 for any xe Z,* 


are defined for p-adic s by taking any sequence {k,} of positive integers 
which approach s p-adically. Another way to define ¢,,,(s) is as follows: 
—limy,. — pet ?~ "Boos @—syk/(S0 + (P — 1K). 

We now see that if k is a positive integer congruent to s, (mod p — 1), i.e., 
k = 59 + (p — I)ko, then we have: ¢,(1 — k) = %,,.,(ko). We think of the 
Cp.a) aS p-adic “branches” of ¢,, one for each congruence class mod p — 1. 
(But note that the odd congruence classes—sy = 1, 3, ..., p — 2—give us 
the zero function, since for such so always By, + p-1)x, = 0; So we are only 
interested in even 5p.) 

In the definition of f, ,,, we excluded the case s = 0 when 5p = O. This is 
because in that case a~“ot‘?-s) = 1, and the denominator vanishes. If we 
write €,(1 — k) = ¢,,,,(ko), where k = 59 + (p — 1)ko, then this excluded 
case corresponds to @,(1). Thus, the p-adic zeta-function, like the Archimedean 
Riemann zeta-function, has a “pole” at 1. 


Theorem 8. For fixed p and fixed 59, £p,5,(5) is a continuous function of s which 
does not depend on the choice of «€Z, ptu, « # 1, which appears in its 
definition. 


Proor. It is clear that §2 and the corollary at the end of §5 imply that the 
integral is a continuous function of s. The factor 1/(«~“o*®-9 — 1) is a 
continuous function as long as we don’t allow s = 0 when sp = 0, because 
a~“o+(-D) is a continuous function by §2. So ¢,,,,(s) is also continuous. 

It remains to show that ¢,,,,(s) does not depend on a. Let Be Z, ptf, 
8 # 1. The two functions 


1 


+(p-1)s-1 
a7 ot@-bs) — | x*o Miva 


Zp* 
and 
eee eens xS0t (P-Ls~1 
B- Got P- D9 —1Jz°* 1,8 
P 


7 A brief survey (no proofs) 


agree whenever 59 + (p — 1)s = k is an integer greater than 0, i.e., whenever 
5 is a nonnegative integer (s > 0 if s9 = 0), since in that case both functions 
equal (1 — p*~+)(— B,/k). But the nonnegative integers are dense in Z,, so 
that any two continuous functions which agree there are equal. Therefore, 
taking B instead of « does not affect the function. oO 


Theorem 8 gives us our p-adic interpolation of the “interesting factor” 
— B2,/2k in ¢(2k). But a few things remain to be explained: (1) the terminology 
““Mazur-Mellin transform” in the title of this section; and (2) the mysterious 
switch from k to 1 — k. In addition, something should be said about (3) 
deeper analogues with classical ¢-functions and L-functions, and (4) a 
connection with modular forms. Since these four topics will take us beyond 
the scope of what we intend to prove in this book, they are gathered together 
in a section which surveys some basic relevant facts without attempting any 
proofs. References for proofs and further discussion of (1)-(4) are: (1) Manin, 
“Periods of cusp forms, and p-adic Hecke series,” §8; (2) Iwasawa, Lectures 
on p-adic L-functions, §1 and appendix; (3) Iwasawa, Lectures on p-adic 
L-functions, especially §5, and Borevich and Shafarevich, Number Theory, 
p. 332-336; (4) Serre, ‘“‘Formes modulaires et fonctions zéta p-adiques,” 
in Springer Lectures Notes in Mathematics 350. 


7. A brief survey (no proofs) 


(1) For s > 1, &(s) can be expressed as an integral 


Le pf gapn Qe 
res |, ae 
where I'(s) is the gamma-function, which satisfies [(s + 1) = sI'(s), [(1) = 1, 
so that, in particular, '(k) = (k — 1)! for positive integers k. (See Exercise 4 


below for the case s = k.) The integral is what is known as a Mellin transform. 
For a function f(x) defined on the positive reals, the function 


0s) =f x-f(x) de, 
0 
whenever it exists, is called the Mellin transform of f(x) (or of f(x) dx). 
Thus, I'(s)é(s) is the Mellin transform of dx/(e* — 1), which exists for s > 1 
(see Exercise 4 below). 


In §6, we showed that the function which p-adically interpolates (1 — p*-+) 
(—B,/k) is essentially (except for the 1/(@~* — 1) factor and the sy business) 


I % xs “Tha 
Zp 
where j;,, is the regularized Mazur measure. Thus, the p-adic {-function 
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can be thought of, in analogy to the classical case, as the ‘‘p-adic Mazur— 
Mellin transform” of the regularized Mazur measure 4; ,¢- 
(2) If we consider 


for s complex, with real part > 1, this sum still converges and defines a 
“complex analytic” function of s. By the technique of “analytic continua- 
tion,” ¢(s) can be extended onto the entire complex plane except for the point 
s = | (where its behavior is like the function 1/(s — 1)). A very basic pro- 
perty of {(s) is that it satisfies a “functional equation” which relates its 
value at s to its value at 1 — s. Namely, 


_ oy — 2 c0s(rs/2)I(s) 
cad S) — (27)5 {(s) 
Let’s let s = 2k be a positive even integer. Then 
_ 2cosmk (2k — 1)! 
cl — 2k) = One £(2k) 
_ 2(—1)K(2k — 1)! (= 1)?*2?*-3 (Boy 
= Gaye Qk = I Ik by Theorem 4 
syd ae, 
2k 


On the other hand, if s is an odd integer greater than 1, the right-hand side 
of the functional equation vanishes because cos(7s/2) = 0 (we need s > 1 in 
order for &(s) to be finite). Hence (1 — s) vanishes, and so there too 
{a — k) = —B,/k, but all this says is that 0 = 0. 


Table of (1 — k) = —B,/k 


= 1/12 

=3 1/120 

=5 — 1/252 

| 1/240 

-~9 21/132 
-11 691/32760 
213 —1/12 
15 3617/8160 
=17 —43867/14364 
-19 17461 1/6600 

—771683/276 
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So what we were “really” interpolating was the Riemann ¢-function at 
negative odd integers. We can now summarize the relationship between ¢, and 
¢ in the following simple way, using the definition of Z,: 


L(1 —k) =(1 — p®)1 — k) fork = 2,3,4,.... 


If we’re a little sloppy (forgetting that everything will diverge), we can 
write: (1 — k) = TTprimese 1/(1 — 9*~?), 


ed-k= [] Wd -¢')=-p' yd -h, 
primes qg,q7#p 
so the appearance of the (1 — p*~*) factor makes heuristic sense from this 
devil-may-care point of view. 
In the same tack, we can derive the formula ((1 — k) = —B,/k ina com- 
pletely straightforward way: 
eo 1 oO 


Ui -Ws 2 aes Do 


n=] =1 


Since (d/dt)*-*e"|,.9 = n*-1, we may write 


Qa 


SEW 


&|a 


—_ —_—, —_— a es. 


SES 


>|" 


(3) Connections between ¢, and ¢ go deeper. An important example 
requires us to consider the generalization of { to functions of the form 


— x(n 
LAs) > 2, s>0, 
n=1 
where x is a “‘character’’ (see Exercises 9-10 of §2). As long as x is not the 
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“trivial” character (equal to 1 for all m), this function L,(s) converges when 
5 = 1: D221 (x(#)/n), and can be computed explicitly. The result is: 


L,(0) = -"0'y) x(a) log(1 — e~ 20%), 


where N is the conductor of y and 7(y) = >X=} x(a)e2"/* (this formula 
easily reduces to those given in Exercises 9-10 of §2). 

In a manner very similar to the construction of £,, it is possible to inter- 
polate L,(1 — k) by “p-adic L-functions” L, ,. Amazingly, it turns out that 
L,,p(1) equals the following expression: 


~ (1 = 22) 2°" ga logy — e-, 


in which “log,” is the “p-adic logarithm,” which is a p-adic function of a 
p-adic variable (see §IV.1 and §IV.2), and all of the roots of unity that 
occur—namely, e?*“/" and the values of y—are considered as elements of 
an algebraic extension of Q, (see §III.2-3). Here (1 — (x(p)/p)) should be 
thought of as the p-Euler factor (for the ¢-function, y = 1, and the Euler 
factor in C(1), if ((1) were finite, would be (1 — (1/p)); see also Exercise 9 of 
§2 concerning Euler products for L,). The rest of the expression for L, p(1) 
is the same as L,(1), except that the classical log is replaced by its p-adic 
analogue log,. 

(4) Very important in the study of elliptic curves and of modular forms 
(see Chapter VII of Serre, A Course in Arithmetic) are the Eisenstein series 
E.,, k = 2, which are functions defined on all complex numbers z with 
positive imaginary part by: 


1 B. < 
E,,(z) = =) aE + > O24, -1(n)e?""™, 
nel 


where 


= m 
On(N) oe d™. 

The series should be thought of as a “‘ Fourier series ””—i.e., a series in powers 
of e?"7_with constant term equal to $Z(1 — 2k). 

It turns out that Eisenstein series can be p-adically interpolated. One hint 
of this is that we can interpolate each nth coefficient as long as p{n. Namely, 
that coefficient o2,_,(m) is a finite sum of functions d?*-1, all of which can 
be interpolated, by §2, since pjd. Then interpolating {(1 — 2k) can be thought 
of as “getting the constant coefficient too.”’ Vague as this all may seem, it is 
actually possible to derive the results in this chapter using the theory of p-adic 
modular forms. For details, see Serre’s paper mentioned before, and papers 
by N. Katz on p-adic Eisenstein measures and p-adic interpolation of Eisen- 
stein series (see Bibliography). 
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EXERCISES 


1. 


Using the relationship between j41,¢, }4x,a, and g,x, give a direct proof (without 
mentioning Bernoulli numbers) that 


1 


dat -1 
a7* -1 ed Pa.a 


does not depend on a. 


. Check the Kummer congruences from the table of (1 — k) when p = 5, 


= 2, k’ = 22, N = 1. Check in the table that the congruences fail when 
p — 1|k. Use the Kummer congruences and the first few values of B, to compute 
the following through the p?-place: 


(i) Broo in Qs; (ii) Boos in Q, (iii) Bso2 in Q,. 


. Use Theorem 7 and Exercise 20 of §I.2 to prove the following version of the 


theorem of Clausen and von Staudt: (B, + > 4)¢ Z, for any even k (or k = 1), where 
the summation is taken over all p for which p — 1|k. 


. Show that {> x*-?dx/(e* — 1) exists when s > 1. By writing 1/(e* — 1) = 


e-*/(1 — e-*) = Yr=1e7"*, prove that: 


ro xe 
| ae =k - 10K) fork = 2,3,4,... 
ar ae 


7 


(justify your computations). 


. Prove that 


[-<) xk-t 
=(k—-D!Id — 21-# = 
I SF de = h - Id = BN) fork = 2,34... 


Show that the function 
1 eae oot 
Tis) — 2!-*)Jo eX +1 


which you just showed equals {(s) for s = k = 2,3,4,..., exists and is 
continuous as a function of s for s > 0, 5s ¥# 1. 


dx, 


6. Prove the Clausen-von Staudt theorem when p = 2. 
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CHAPTER III 


Building up © 


1. Finite fields 


In what follows, we'll have to assume familiarity with a few basic notions 
concerning algebraic extensions of fields. It would take us too far afield to 
review all the proofs; for a complete and readable treatment, see Lang’s 
Algebra or Herstein’s Topics in Algebra. We shall need the following concepts 
and facts: 


(1) The abstract definition of a field F; a field extension K of F is any field 
K containing F; a field extension K is called algebraic if every «€ K 
is an algebraic element, i.e., satisfies; a polynomial equation with 
coefficients in F: dg + a,% + a,a? +---+a,a" = 0, where a; F. For 
example, the set of all numbers a + b ./2 with a, be Q is an alge- 
braic extension of Q. 

(2) If F is any field, its characteristic char(F) is defined as the least n such 
that when you add | to itself m times you get 0. If 1+ 1+---+ 1 
always # 0, we say char(F) = 0. (It might sound more sensible to say 
char(F) = ©, but the convention is to say that such fields have charac- 
teristic 0.) Q, Q,, R, and C are fields of characteristic 0, while the set 
of residue classes modulo a prime p is a field of characteristic p. (We'll 
see more examples of fields of characteristic p in a little while.) 

(3) The definition of a vector space V over a field F; what it means to have 
a basis for V over F; what it means for V to be finite-dimensional; if V 
is finite-dimensional, its dimension is the number of elements in a basis. 

(4) A field extension K of F is an F-vector space; if it is finite-dimensional, 
it must be an algebraic extension, and its dimension is called the degree 
[K:F]. If a eK has the property that every element of K can be written 
as a rational expression in «, we write K = F(«) and say that K is the 
extension obtained by “adjoining” « to F. If K’ is a finite extension of 
K, then it is easy to see that K’ is a finite extension of F, and [K’:F] = 
[K’: K] -[K:F]. 
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(5) 


(6 


~S 


(7 


~— 


(8) 


(9) 


1 Finite fields 


Any element a in a field extension K of F which is algebraic over F 
satisfies a unique monic irreducible polynomial over F (“‘monic” means 
it has leading coefficient 1, “irreducible”? means it cannot be factored 
into a product of polynomials of lower degree with coefficients in F): 


a” + a,_ya"" 1} +--+ aja + a) = 0, a,¢€ F; 


nis called the degree of a. The field extension F(a) has degree n over F 
(in fact, {1, a, «7, ...,a"~1} is one possible basis for F(a) as a vector 
space over F). 

If F is a field of characteristic 0 (for example, @ or Q,) or a finite field 
(we'll study finite fields in detail very soon), then it can be proved that 
any finite extension K of F is of the form K = F(a) for some ae K. 
a is called a “‘primitive element.” (Actually, this holds if F is any 
“perfect” field, where ‘“‘perfect”? means that either char (F) = 0, or 
else, if char (F) = p, every element in F has a pth root in F.) Knowing a 
primitive element a of a field extension K makes it easier to study K, 
since it means that everything in K is a polynomial in « of degree <n, 
ie., K = {S020 aa | a, € F}. 

Given an irreducible polynomial f of degree n with coefficients in F, we 
can construct a field extension K > F of degree n in which f has a root 
a«€K. Roots of all possible polynomials with coefficients in F can be 
successively adjoined in this way to obtain an “algebraic closure” 
(written F*€° or F) of the field F; by definition, this means a smallest 
possible algebraically closed field containing F (recall: a field K is called 
algebraically closed if every polynomial with coefficients in K has a 
root in K). Any algebraic extension of F is contained in an algebraic 
closure of F (i.e., can be extended to an algebraic closure of F). Any 
two algebraic closures of F are isomorphic, so we usually say “‘the 
algebraic closure,” meaning ‘“‘any algebraic closure.” The algebraic 
closure of a field F is usually the union of an infinite number of finite 
algebraic extensions of F; for example, the algebraic closure of Q@ 
consists of all complex numbers which satisfy a polynomial equation 
with rational coefficients. However, the algebraic closure of the real 
numbers R is C = R(V — 1), which is a finite algebraic extension of R of 
degree 2; but this is the exception rather than the rule. 

If K = F(a), if K’ is another extension field of F, and if o: K—> K’ 
gives an isomorphism of K with a subfield of K’ (where o is an “ F- 
homomorphism,” i.e., it preserves the field operations, and o(a) = a 
for all ae F), then the image o(«) of a in K’ satisfies the same monic 
irreducible polynomial over F as a does. Conversely, if K = F(a), if 
K’ is another extension field of F, and if «’ € K’ satisfies the same monic 
irreducible polynomial over F as a does, then there exists a unique 
isomorphism o of K with the subfield F(«’) of K’ such that o(a) = a for 
all ae F and such that o(«) = a’. 

In F = F*€:l, all the roots of the monic irreducible equation over F 
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satisfied by an element o € F are called the conjugates of «. There is a 
one-to-one correspondence between isomorphisms of F(«) with a sub- 
field of F and conjugates «’ of a (see the preceding paragraph (8)). If 
char (F) = 0 or if F is finite (or if F is perfect), then an irreducible 
polynomial can not have multiple roots, i.e., all the conjugates of « 
are distinct. In that case: (number of conjugates) = [F(«):F]. 

(10) If K = F(a), then K is called “‘Galois”’ if all of the conjugates of « are 
in K. In that case all conjugates of any x = >?2¢ aja‘ € K are in K, since 
such a conjugate is of the form >72¢ aa", where «’ is a conjugate of «. 
Examples of Galois extensions of Q are: Q(V2) (since « = V2 has 
one conjugate «e’ = —/2, which is the other root of x? — 2 = 0; here 
—V2€ Q(V2)); Q(i); Q(V a) for any de Q; Q(L,,), where £,, = e27/m 
is a primitive mth root of 1 in C (since the conjugates of £,, are other primi- 
tive mth roots, and these are of the form @,,‘ for i having no common 
factor with m). An example of a non-Galois extension of Q is Q(W2), 
since the conjugates of “2 are the 4 roots of xt — 2 =0, namely 
+02 and +i¥/2, and we have iW2 ¢ Q(W2) (since Q(W2) is contained 
in the real numbers). 

(11) If K is a Galois extension of F, then the isomorphisms in paragraph (8) 
all have image KX itself, i.e., they are F-isomorphisms from K to K, or 
‘“‘F-automorphisms of K.” The set of these automorphisms is a group, 
called the “‘Galois group of K over F.”’ If o is such an automorphism, 
then the set of x e K such that o(x) = x is called the “fixed field of o”’ 
(it’s easy to see that it’s a subfield of K containing F). For example, 
if K = Q(V2 + V3), which is a Galois field extension of Q of degree 4, 
and if o takes V2 + V3 to V2 — V3, then the fixed field of o turns 
out to be Q(vV/2). It is not hard to prove that, if K is a Galois extension 
of Fand K’ # Kis a field between K and F: F< K’ C K, then there is 
a nontrivial automorphism of K which leaves K’ fixed. In turns out there 
is a one-to-one correspondence between subgroups S of the Galois group 
of K over F and such intermediate fields F < K’ < K, where 

S< Ks = {xe K| ox = x forallae S}. 
But we shall only need simple cases of the facts in this paragraph, not 
the full power of Galois theory. 


We now proceed to the study of finite fields. The simplest example of a 
finite field is the ‘integers modulo a prime p.” This means: take the set of 
equivalence classes of integers for the equivalence relation: x ~ y means 
x = y(mod p). There are p such equivalence classes: the class of 0, 1, 2, 3, 
...>P — 2,p — 1. It is easy to define addition and multiplication and check 
that this set, which we call F,, forms a field (in particular, every non-zero 
equivalence class has an inverse; this amounts to saying that if p does not 
divide x, then there exists a y such that xy = 1 (mod p)). F, is sometimes 
written Z/pZ (meaning “the integers divided out by p times the integers’). 

We could have equally well started out with the p-adic integers Z,, and 
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defined x ~ y (x, ye Z,) to mean x = y (mod p) (i.e., x and y have the same 
first digit in their p-adic expansions). That is, F, can also be written Z,/pZ, 
(‘‘the p-adic integers divided out by p times the p-adic integers”’). Z,/pZ, is 
called the “‘residue field” of Z,. The reason why we have to study general 
finite fields before going further is as follows: the residue fields we get when, 
instead of Q, and Z,, we deal with algebraic extensions of Q, are not quite as 
simple as F,. They turn out to be algebraic extensions of F,. So we need to get 
a picture of what general finite fields look like. 

Let F be a finite field. Since not all the numbers 0, 1,1 + 1,1 +14 1,... 
can be distinct, F must have characteristic 40. Let nm = char(F). Note that 
n must be a prime, since if we could write m = non,, with mp and n, both <n, 
we would have 1) ¥ 0, so multiplying by n)~! would give: n, = no~1n = 0, 
a contradiction. So let p denote the prime number char(F). 

Clearly, any field F of characteristic p contains the field of p elements as a 
subfield (namely, by taking the subfield of F formed by all numbers of the 
form ] + --- + 1). This subfield is called the “prime field” of F. 

Note that in any field F of characteristic p, the map x+> x” preserves 
addition and multiplication: 


xy b> (xy)? = x?y?; 


Pp 
xt yr(xt yp = D> (P)xy- = x? + y?, 
{=0 
because for 1 < i < p — | the integer (?) = p!/(i! (p — i)!) is divisible by p, 
and hence equal to 0 in F. 


Theorem 9. Let F be a finite field containing q elements, and let f = [F:F,] 
(i.e., the dimension of F as a vector space over its prime field F,). Let K be an 
algebraic closure of F, containing F. Then q = p’; F is the only field of q 
elements contained in K; and F is the set of all elements of K satisfying the 
equation x* — x = 0. Conversely, for any power q = p’ of p, the roots of the 
equation x* — x = 0 in K are a field of q elements. 


Proor. Since F is an f-dimensional vector space over F,, the number of 
elements is equal to the number of choices of the f components (i.e., “‘co- 
ordinates” in terms of a basis of f elements) from F,, which is p’. Next, any 
field F of g elements has q — 1 nonzero elements, so that the nonzero elements 
of F under multiplication form a group of order g — 1. In this group the 
powers of an element x form a subgroup of order equal to the least power of 
x which equals one (called the ‘‘order”’ of x). But it is easy to prove that any 
subgroup of a finite group has order dividing the order of the group. Thus, x 
has order dividing g — 1, and so x*~-+ = 1 for all nonzero x in F. Then 
xt — x = 0 for all x (including 0) in F. Since this holds for any field of g 
elements in K, and a polynomial of degree q has at most g distinct roots in a 
field, it follows that any field of gq elements in K must be the roots of x* ~— x, 
and there is only one such set of q roots. 
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Conversely, given any g = p’, the set of elements of K such that x* = x is 
closed under addition and multiplication (the argument is the same as in the 
paragraph right before the statement of this theorem), and so is a subfield of 
K. This polynomial has distinct roots, because, if it had a double root, by 
Exercise 10 below, that root would also be a root of the formal derivative 
polynomial gx*-1 — 1 = —1 (because g = 0 in K); but the polynomial —1 
has no roots. oO 


Remark. Because any two algebraic closures of F, are isomorphic, it 
follows that any two fields of g = p’ elements are isomorphic. 


We let F, denote the unique (up to isomorphism) field of g = p’ elements. 
If F is a field, F* denotes the multiplicative group of non-zero elements 
of F. 


Proposition. F,* is a cyclic group of order q — 1. 


Proor. If we let o(x) denote the order of x (the least power of x which 
equals 1), we know that o(x) is a divisor of g — 1 for all xe F,*. But if d is 
any divisor of gq — 1, the equation x* = | has at most d solutions, because the 
degree d polynomial x? — 1 has at most d roots in a field. If d = o(x), then 
all d distinct elements x, x”, ..., x?~1, x4 = 1 satisfy this equation, and so 
they must be the only ones that do. How many of these d elements have 
order exactly d? It is easy to see that the answer is: the number of integers 
in {1, 2, ...,d— 1, d} which are relatively prime to d (have no common 
divisor with d other than 1). This number is denoted ¢(d). Thus, at most ¢(d@) 
elements of F,* have order d. We claim that exactly g(d) have order d for all 
divisors d of g — 1, and in particular for d = q — 1. This follows from the 
following lemma. 


Lemma. >4,.9(d) = n. 


PROOF OF THE LEMMA. Let Z/nZ denote the additive group {0, 1, ...,m — l} of 
integers modulo n. Z/nZ contains a subgroup S, for each divisor d of n defined 
as follows: S, is the set of all multiples of n/d. Clearly, every subgroup of 
Z/nZ is obtained in this way. 

S, has d elements, of which ¢(d) generate the full subgroup (i.e., the set of 
all multiples of mn/d exhausts the set of all multiples of n/din Z/nZ if and only 
if m and d are relatively prime). But each integer 0,1, ..., — 1 generates 
one of the subgroups S,. Hence 


{0, 1, ...,2 — 1} = (LJ {elements generating S,}. 
din 
Since this is a disjoint union, we have: n = >4a,9(d), and the lemma is proved. 
The proposition follows immediately, because if there were fewer than 


¢(d) elements of order d for some d|n ,we would have: n = 34), {elements of 
order d} < Yang(d) = n. Hence, in particular, there are y(g — 1) elements 
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of order g — 1. Since g(g — 1) = 1 (for example, 1 has no common factor 
with q — 1), there exists an element a of order exactly g — 1. Then F,* = 
{a, a’, ..., a7~}. O 


EXERCISES 


1. Let F be a field of g = p’ elements. Show that F contains one (and only one) 
field of q’ = p” elements if and only if f’ divides f. 


2. For p = 2,3, 5,7, 11, and 13, find an element ae {1,2,...,p — 1} which 
generates F,*, i.e., such that F,* = {a, a”, ..., a?~*}. In each case, determine 
how many choices there are for such an element a. 


3. Let F be the set of numbers of the form a + 5j, where a, be F3 = {0, 1, 2}, 
addition is defined component-wise, and multiplication is defined by 
(a + bj)(c + dj) = (ac + 2bd) + (ad + bc)j. Show that F = Fy; show that 
1 + jis a generator of Fy * ; and find all possible choices of a generator of Fy”. 


4. Write F, and F, explicitly in the same way as was done for Fy in the previous 
problem. Explain why any element except 1 in F,% or F,” is a generator. 


5. Let gq = p’, and let a be an element generating F,*. Let P(X) be the monic 
irreducible polynomial which a satisfies over F,. Prove that deg P = f. 

6. Let q = p’. Prove that there are precisely f automorphisms of F, over F,, namely the 
automorphisms o;, i = 0,1, ...,f — 1, given by: o,(x) = x?’ for x € Fy. 


7. Leta eF,*, and let P(Y) = X? — X — a. Show that, if a is a root of P(X), 
then so isa + 1, a + 2, etc. Show that the field obtained by adjoining a to F, 
has degree p over Fy, i.e., it is isomorphic to Fy». 


8. Prove that F, contains a square root of —1 if and only if gq # 3 (mod 4). 


9. Let ¢ be algebraic of degree 1 over Q,, i.e., £ satisfies a polynomial equation of 
degree 7 with coefficients in Q,, but none of degree less than n. Prove that 
there exists an integer N such that € does not satisfy any congruence 

Gn —1€"~ 2 + an-2€"~? + +--+ + ay€ + ao = 0 (mod p*), 
in which the a, are rational integers not all of which are divisible by p. 

10. If F is any field, and f(X) = X" + an-yX""1 +--+ + a,X + ao has co- 

efficients in F and factors in F, i.e., f(X) = []f=1 (XY — o) with a, € F, show 


that any root a, which occurs more than once is also a root of nX"~1 + 
An —-1(n — WX"? + an-o(n—2)X"-$ +--+ + a, 


2. Extension of norms 


If X is a metric space, we say X is compact if every sequence has a convergent 
subsequence (see beginning of §II.3). For example, Z, is a compact metric 
space (see Exercise 20 of §1.5). We say that X is locally compact if every point 
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xeéX has a neighborhood (i.e., a subset of X containing some disc 
{y | d(x, y) < e}) which is compact. The real numbers R are not compact, 
but are a locally compact metric space with the usual Archimedian absolute 
value metric. @, with the p-adic metric is another example of a locally 
compact metric space, for the simple reason that for any x the neighborhood 
x + Z, = {y||y — xlp < 1} is compact (in fact, it is isomorphic to Z, as a 
metric space). More generally, if X is an additive group such that d(x, y) = 
d(x — y, 0) for all x, y (for example, if X is a vector space and the metric is 
induced from a norm on X, as defined below), then X is locally compact 
whenever 0 has a compact neighborhood U. Namely, for any x, the transla- 
tion of U by x:x + U= {y| y — xe U}, is a compact neighborhood of x. 
In Q,, U = Z, is such a compact neighborhood of 0. It is not hard to see 
(Exercise 6 below) that any such locally compact group is complete. 

Let F be a field with a non-Archimedean norm || ||. For the duration of 
this section we assume that F is locally compact. 

Let V be a finite dimensional vector space over F. By a norm on V we 
mean the analogous thing to a norm on a field, namely, a map || ||y from V 
to the nonnegative real numbers satisfying: (1) ||x||y = 0 if and only if 
x = 0; (2) |lax||y = lal] ||xlly for all x e Vand ae F (here jja|| is the norm in 
F); and (3) ||x + ylly < ||xlly + ||» lv. For example, if K is a finite extension 
field of F, then any norm on K as a field whose restriction to F is || || is also 
a norm on K as a vector space. However, a word of caution: the converse is 
false, since Property (2) for a vector space norm is weaker than the corres- 
ponding property for a field norm (see Exercises 3-4 below). 

As in the case of fields, we say that two norms || ||, and || |. on V are 
equivalent if a sequence of vectors is Cauchy with respect to || ||, if and only 
if it is Cauchy with respect to || ||. This is true if and only if there exist posi- 
tive constants c, and c, such that for all xe V: |x]. < c, |x|], and |x|, < 
C2||x|] 2 (see Exercise 1 below). 


Theorem 10. If V is a finite dimensional vector space over a locally compact 
field F, then all norms on V are equivalent. 


ProoF. Let {v,, ..., ¥,} be a basis for V. Define the sup-norm || ||su, (pro- 
nounced “‘soup norm”’) on V by 


lai, Sea @,Vnllsup det coe (la ll). 


Note that this definition depends on the choice of basis. This || || up is a norm 
(see Exercise 2 below). Now let || ||), be any other norm on VJ. First of all, for 
any x = a,v, +--+: + a,v, we have 


IlxIv < lar lerlly +--+ + lanl leally 
< n(max ||a,||) max ||o\||v, 


so that we get || ly < ci || llaup if we choose c, = m max; <icn(|%llv). It 
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remains to find a constant c. such that the reverse inequality holds; then it 
will follow that any norm on V is equivalent to the sup-norm. 
Let 
U= {xe V | ||x|| sup = 1}. 


We claim that there is some positive e such that ||x||y > e for xe U. If this 
weren’t the case, we would have a sequence {x,} in U such that ||x;||y > 0. 
By the compactness of U with respect to |] ||sup (Exercises 2, 8 below), there 
exists a subsequence {x;,,} which converges in the sup-norm to some xe U. 
But for every j 


I|xXllv < |x — xylv + [lxylly < ex [lx — Xyllsup + [,llv, 


by our first inequality relating the two norms. Both of these terms approach 
0 as j — 00, since x,, converges to xin || ||sup, and ||x;||y > 0. Hence ||x||, = 0, 
so that x = 0¢ U, a contradiction. 

Using this claim, we can easily prove the second inequality, and hence the 
theorem. The idea is: the claim says that on the sup-norm unit sphere U 
the other norm || ||, remains greater than or equal to some positive number e, 
and hence || ||sup < ¢2 || lly on U, where cy = 1/e (on U the left side of this 
inequality is 1, by definition); but everything in V can be obtained by multi- 
plying U by scalars (elements in F), so the same inequality holds on all of V. 

More precisely, let x = a,v, +:--+,v, be any nonzero element in V, and 
choose j so that ||a,|| = max||a,|| = ||x{|sup. Then clearly (x/a,) ¢ U, and so 


\|x/a;lly = © = 1/ca, 
so that 
||| sup = |la,l| < co ||x\lv. Oo 


Corollary. Let V = K be a field. Then there is at most one norm || ||x of Kasa 
field which extends || || on F (i.e., such that |\a||x = |lal| for ae F). 


PROOF OF COROLLARY. Note that the field norm || _||x is also an F-vector space 
norm, because it extends || ||. By Theorem 10, any two such norms || ||, and 
|| ||, on K must be equivalent. Hence || ||, < c,|| ||,. Let x € K be such that 
ll, # Ixll2, say, |x|], < |x|]. But then for a sufficiently large N we have 
cy llx* Il, < ||x*|l2, a contradiction. O 


This still leaves the question of whether there exists any norm on K 
extending || || on F. 

We now recall a basic concept in field extensions, that of the “norm” of 
an element. This use of the word “norm” should not be confused with the 
use so far in the sense of metrics. ‘‘Norm” in the new sense will always 
be in quotation marks and denoted by N. 

Let K = F(a) be a finite extension of a field F generated by an element a 
which satisfies a monic irreducible equation 


O = x* + ax" 1 +--+ + ay_ix + ay, a,¢€ F. 
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Then the following three definitions of the “‘norm of « from K to F,” abbre- 
viated Nx)p(«), are equivalent: 


(1) If K is considered as an n-dimensional vector space over F, then multipli- 
cation by a is an F-linear map from K to K having some matrix A,. We 
let Nxir(a) = det(A,). 

(2) Nair(e) & (—1)"an. 

(3) Nxir(e) = []f=1%, where the «, are the conjugates of a = a, over F. 


The equivalence (2) <> (3) comes from: x" + a,x"-}+.---+ a, =T]fe1 
(x — a). The equivalence (1) = (2) is easy to see if we use {1, «, a, ..., a"-1} 
as a basis for K over F. Namely, the matrix of multiplication by « is then 


clearly (using: «” = —aja""! —--- — a,_y« — a,): 
0 0 —a, 
1 0 0 —An-j 
1 0 
0 —A2 
1 —a, 


which has determinant (—1)"a,, as follows immediately by expanding using 
the first row. 

If Be K = F(a), we can define Nx,-(8) as either (1) the determinant of the 
matrix of multiplication by 8 in K, or, equivalently, (2) (Neg), r(8))* FO, The 
two are equivalent because, if we choose bases for F(f) as a vector space 
over F and for K as a vector space over F(f), then as a basis for K over F we 
can take all products of an element in the first basis with an element in the 
second basis; using this basis for K over F, we see that the matrix of multipli- 
cation by f in K takes the following “‘ block form” 


A, 0 
0 A, 


As 


where A, is the matrix of multiplication by 8 in F(8). The determinant of this 
matrix is the [K:F(8)]-th power ([K:F(§)] is the number of blocks) of det Az, 
ie., the [K:F(8)]-th power of Nagy (8). Thus, the two definitions of Nx;(8) 
are in fact equivalent. 

Since Nx)-(a) is defined for any « € K as the determinant of the matrix 
of multiplication by a in K, it follows that Nx,p is a multiplicative map from 
K to F, ie., Nxjr(oB) = Nx p(a)Nx p(B). (Namely, multiplication by of is 
given by the product of the matrix for « and the matrix for B, and the determi- 
nant of a product of matrices is the product of the determinants.) 

We can now figure out how the extension of | |, to an algebraic number 
« € Q2/8°! must be defined if it exists. Suppose « has degree n, i.e., its monic 
irreducible polynomial over Q, has degree n. Let K be a finite Galois extension 
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of Q, containing a (see paragraph (10) at the beginning of this chapter), for 
example, K can be the field obtained by adjoining « and all of its conjugates 
to Q, (it’s easy to check that this field is finite and Galois over Q,). Suppose 
we find an extension || || of | |, to K. By the corollary to Theorem 10, || | 
is the unique field norm on K extending | |,. Now let «’ be any conjugate of a, 
and let o be a Q,-automorphism of K taking a to a’ (see paragraphs (8), (9), 
and (11) in §III.1). Clearly the map || ||: K > R defined by ||x||' = |lo(x)|| is 
a field norm on K which extends | |,. Hence || ||’ = || ||, and so ||| = lol’ = 
\|o(a)|| = {la’||. We conclude that the norm of « equals the norm of each of its 
conjugates. But then the norm of Ng,aa,(«), Which is in Q,, equals 


| No,wia,()|» = I Na, aa) I 
=| [] #' 
conjugates a’ of « 
=[ Tle’ 
= ||". 
Thus, 
Kel = [Na a,(@)|3”- 


So, concretely speaking, to find the p-adic norm of a, look at the monic 
irreducible polynomial satisfied by a over Q,. If it has degree n and constant 
term a,, then the p-adic norm of a is the nth root of |a,|,. (Of course, we have 
not yet proved that this rule really has all of the required properties of a 
norm; this will be Theorem 11 below.) 

Note that we can equivalently define ||«|| to be 


I Nxag(a)|2"* "Or, 


where K is now any field of finite degree over Q, that contains a, 


Nx 1%) = (Na,@ia,())™ 7Q,(@)1 


and 
= ‘ _ [K:Q,] 
n= [Q,(a):Q,] th [K: Q,(«)] 
We now prove that this rule || || really is a norm. We shall write | |, 
instead of || || to denote the extension of | |, tu K; this should not cause 


confusion. The reader should be warned that Theorem 11 is not an easy 
fact to prove. The proof given here, which was told to me by D. Kazhdan, 
is much more efficient than other proofs I’ve seen. But it should be read and 
re-read carefully until the reader is thoroughly convinced by the argument. 


Theorem 11. Ler K be a finite extension of Q,. Then there exists a field norm 
on K which extends the norm | |, on Q,. 


Proor. Let n = [K:Q,]. We first define | |, on K, and then prove that it’s 
really a field norm on K extending | |, on Q,. For any a € K we define 
lelp = INxa,(2)l5”, 
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where the right-hand side is the old norm in Q,. (Here 7 is the degree of the 
field K over Q,, and is not necessarily the degree of the element « over Q,.) 
It is easy to check that: (1) ||, agrees with the old ||, whenever a € Q,; (2) 
|a|, is multiplicative ; and (3) |a|, = 0<>a = 0. The hard part is the property: 
la + Bl, < max(|al,, |Bl,). 

Suppose that ||, is the larger of |«|,, |B|,. Setting y = a/B, we have 
lylp <1. We want to show that |« + B|, < max(|a|,,|B|,) = |Blp, or 
equivalently (after dividing through by ||,): |1 + y|, < 1. Thus, Theorem 
11 follows from the following lemma. 


Lemma. With | |, defined as above on K, one has |1 + y|, < 1 for any ye K 
with |y|, < 1 


PROOF OF THE LEMMA. We noted before that we can define |y|, and |1 + y|, 
using the field Q,(y) = Q,(1 + ») in place of K: 


lYlp= Na, ea,(7) |p or Ml |i + Yb = |Ne, oye, + y) |p tor: "Onl 


So without loss of generality we may suppose that K = Q,(y), in other words, 
that y is a “primitive element” of K. Then {1, y, y?,..., 7" 1} is a vector space 
basis for K over Q,, where n = [K:Q PB 

For any element « = >"2d a,yie K, let «|| denote the sup-norm in this 
basis, i.€., ||a|| =, max; |a;|,. Similarly, if A = {a;;} is any n x n matrix with 
entries in Q,, let ||A|| denote the sup-norm |All = max, ; |a,jlp- 

Any Q,-linear map from K to K, when written in terms of the basis 
{1,y, y?,...,y"" 1}, givesann x nmatrix with entries in Q p- Now let A denote 
the matrix of the Q,-linear map from K to K which is multiplication by y. 
(This is the type of matrix used before in our discussion of the three equivalent 
definitions of the norm of an element.) Then the matrix A’ is the matrix cor- 
responding to multiplication by y', and J + A is the matrix corresponding to 
multiplication by 1 + y. (More generally, the matrix P(A) corresponds to 
multiplication by the element P(y) for any polynomial P € Q,[ X ].) 

We claim that the sequence of real numbers {||A'‘|};=0,1,2,... is bounded 
above. Suppose the contrary. Then we can find a sequence j;, j = 1, 2, . 
such ae | A‘il| > 7. Let bs A‘, which is the maximum | |, of any ‘of 
the n? entries of A‘). Let B; be an entry of A‘) with maximum | |,; thus, 
|Bjlp = \|A‘/l| = 5;. Define the matrix B, = Aii/B,, i.e., divide all entries of Ais 
by B;. (Note that B; #0, since ||A'/|| > 7.) Then clearly |B; || = 1. Since the 
sup-norm unit ball is compact (Exercises 2 and 8 below), we can find a 
subsequence {B,, },=1,2,... which converges to some matrix B. Since det Bj = 
(det A‘’)/B?, we have 


|det B;|, < |det A¥|,/" = |Nxja,(7)"'lp/i” = ly lpr" S 1". 


By the definition of convergence in the sup-norm, each entry of B is the limit 
as k — co of the corresponding entry of B,,; hence det B = lim det B,, = 0. 

Because det B = 0, there exists a nonzero element | € K, considered as a 
vector written with respect to the basis {1, y, y?,..., y"~ 1}, such that B/ = 0. 
We now show that this implies that B is identically zero, contradicting 
|| B|| = 1 and hence proving the claim that {||A‘||} is bounded. 


62 


2 Extension of norms 


Since {y'l};-0,1,.....-1 iS a basis for the Q,-vector space K, it suffices to 
show that By'l = 0 for any i. But since multiplication by y' is given by the 
matrix A‘, we have By'] = (BA')I = A‘Bl = 0, where the relation BA' = A'B 
comes from the fact that B is a limit of matrices of the form B; = A‘/B,, 
i.e., scalar multiples of powers of A, and any such matrix B; commutes with 
A‘. This proves our claim that {||A‘||} is bounded by some constant C. 

Note that foranyn x nmatrix A = {a,,} we have: |det A|, < (max,,,|a,,|,)" 
= ||A||"; this is clear if we expand the determinant and use the additive and 
multiplicative properties of a non-Archimedean norm. 

Now let N be very large, and consider: (I+ A)X =I1+(4)A4-°°4+ 


(wy ,)AN71 + AN. We have 
I 


Hence |1 + y|, < WC. Letting N — o gives |1 + y|, < 1 as required. (Note 
the similarity with the proof of Ostrowski’s theorem in §1.2.) O 


[1 + ylp¥ = |det(Z + A)*|}" < (+ A) < ( max 


<i<N 


O<isN 


< ( max iA‘) <C. 


Let R be a (commutative) ring, i.e., a set R with two operations + and - 
which satisfy all the rules of a field except for the existence of multiplicative 
inverses. In other words, it’s an additive group under +; has associativity, 
identity, and commutativity under -; and has distributivity. R is called an 
integral domain if xy = 0 always implies x = 0 or y = 0. Z and Z, are 
examples of integral domains. 

A proper subset J of R is called an ideal if it is an additive subgroup of R 
and for all x €e Rand ae J we have: xa € J. In the ring Z, the set of all multiples 
of a fixed integer is an ideal. In Z,, for any r < 1 the set {x €Z,| |x|, < r}is 
an ideal. If, say, r = p~", this is the set of all p-adic integers whose first 
n + 1 digits are zero in the p-adic expansion. 

If J, and J, are ideals of R, then the set 

{x € R| x can be written as x = x\xX,' +--+ + XmXm’ With x,E1,, x; € 15} 
is easily checked to be an ideal, which is written I,J, and is called the product 
of the two ideals. An ideal J is called prime if: x,x2,¢J implies x, ¢J or 
x2 ET. 

It is easy to verify (see Exercise 5 below) that Z, has precisely one prime 
ideal, namely 


Ply {x EZ, | |xlp < 1}, 
and that all ideals of Z, are of the form 
ply det {xeZ, | |xlp < p~"}- 


If J is an ideal in a ring R, it is easy to see that the set of additive cosets 
x + I form a ring, called R/J. (Another way of describing this ring: the set of 
equivalence classes of elements of R with respect to the equivalence relation 
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x ~ yif x — ye) For example, if R = Z (or if R = Z,), the ring R/pR is 
the field F, of p elements, as we’ve seen. 

An ideal M in R is called maximal if there is no ideal strictly between M 
and R. It is an easy exercise to check that: 


(1) An ideal P is prime if and only if R/P is an integral domain. 
(2) An ideal M is maximal if and only if R/M is a field. 


Now let K be a finite extension field of Q,. (Or, more generally, let K be 
an algebraic extension of a field F which is the field of fractions of an integral 
domain R, e.g., F = Q is the field of fractions of R = Z, F = Q, is the field 
of fractions of R = Z,.) Let A be the set of all « € K which satisfy an equa- 
tion of the form x" +a,x""'+---+a,_,x+a,=0 with the a,€Z,. 
(Every «eK of course satisfies an equation of this form with coefficients 
in Q,, but usually not all the a; are in Z,.) A is called the “integral closure 
of Z, in K.” 

It is not hard to show that if « € A, then its monic irreducible polynomial 
has the above form. Moreover, the integral closure is always a ring. (For the 
general proof, see Lang’s Algebra, pp. 237—240; in the case we’ll be working 
with—the integral closure of Z, in K—we prove that it’s a ring in the proposi- 
tion that follows.) 


Proposition. Let K be a finite extension of Q, of degree n, and let 


A={xeK| |x|, < 1}, 
M = {xeK| |x|, < 1}. 


Then A is a ring, which is the integral closure of Z, in K. M is its unique 
maximal ideal, and A/M is a finite extension of F, of degree at most n. 


ProorF. It is easy to check that A is a ring and M is an ideal in A, using the 
additive and multiplicative properties of a non-Archimedean norm. Now let 
a € K have degree m over Q,, and suppose that a is integral over Z,: e™ + 
ae" 4.-.+4 4a, = 0,4a,€2Z,. If jal, > 1, we would have: 


Jol," = |o"|, = |aya"~2 +--+ + aplp S max |aja™~'|, 
Pp 
l<ism 


< max |e™'], = |alp~’, 
lsism 

a contradiction. Conversely, suppose |«|, < 1. Then all the conjugates of 
«=a, over Q, also have |a|, = []7-:[e,|/}" = ||, < 1. Since all the 
coefficients in the monic irreducible polynomial of « are sums or differences 
of products of «; (the so-called ‘‘symmetric polynomials” in the «,), it follows 
that these coefficients also have | jp < 1. Since they lie in Q,, they hence 
must lie in Z,. 

We now prove that M contains every ideal of A. Suppose «€ A, «o¢ M. 
Then |a|, = 1, so that |1/a|, = 1, and 1/a € A. Hence any ideai containing « 
must contain (1/a)-« = 1, which is impossible. 
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Note that M2 Z, = pZ, from the definition of M. 

Consider the field A/M. Recall that its elements are cosets a+ M. 
Notice that if a and b happen to be in Z,, then a + M is the same coset as 
b + Mifand only ifa — be MZ, = pZ,. Thus, there’s a natural inclusion 
of Z,/pZ, into A/M given by coset a + pZ,+> coset a + M for ae Z,. Since 
Z,/pZ,» is the field F, of p elements, this means that A/M is an extension field 
of F,. 

We now claim that A/M has finite degree over F,, in fact, that [A/M: F,] < 
[K: Q,]. If m = [K: Q,], we show that any n+ 1 elements 4), a, ..., 
G,.,€A/M must be linearly dependent over F,. For i = 1,2,...,” + 1, 
let a, be any element in A which maps to 4@, under the map A — A/M (ie., 
a, is any element in the coset 4, in other words: a, = a, + M). Since 
[K: Q,] = n, it follows that a,, do, ..., dn41 are linearly dependent over Q,: 


a,b, + aabe teeet Qn +10 n +1 — 0, be€ Q,. 


Multiplying through by a suitable power of p, we may assume that all the 
b,€ Z, but at least one b, is not in pZ,. Then the image of this expression in 
A/M is 

a,b, + aby ai Gn+rBna1 = 0, 
where 5, is the image of 5, in Z,/pZ, (i.e., 5, is the first digit in the p-adic 
expansion of 5,). Since at least one b, is not in pZ,, it follows that at least one 
5, is not 0, so that @,, da, ..., G,41 are linearly dependent, as claimed. im 


The field A/M is called the residue field of K. It’s a field extension of F, of 
some finite degree f. A itself is called the ‘‘valuation ring” of | |, in K. 


EXERCISES 


1. Prove that two vector space norms || ||; and || ||. ona finite dimensional vector 
space V are equivalent if and only if there exist c, > 0 and cg > 0 such that 


for allxe V: 
xl]2 < exl|x]]1 and xl < callxlle. 


2. Let F be a field with a norm || ||. Let V be a finite dimensional vector space 
over F with a basis {v, ..., v,}. Prove that |aiv. + --- + @nvallsup & Maxi cin 
(Ia;||) is a norm on V. Prove that if F is locally compact, then V is locally compact 
with respect to || |lsup- 

3. Let V = Q,(Vp), v1 = 1, v2 = vp. Show that the sup-norm is not a field 
norm on Q,(Vp). 

4. If V = K is a field, can the sup-norm ever be a field norm (for any basis 


{v,, ..., Unt) when 2 = dim K > 1? Discuss what type of finite extensions 
K of Q, can never have the sup-norm being a field norm. 


5. Prove that Z, has precisely one prime ideal, namely pZ,, and that all ideals in 
Z, are of the form p"Z,, ne {1, 2, 3, ...}. 


6. Prove that, if a vector space with a norm || |ly is locally compact, then it is 
complete. 
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7. Prove that a vector space with a norm || ||y is locally compact if and only if 
{x | [xllv < 1} is compact. 


8. Prove that, if a vector space with a norm || |ly is locally compact, then 
{x | xlly = 1} is compact. 


3. The algebraic closure of Q, 


Putting together the two theorems in §2, we conclude that | |, has a unique 
extension (which we also denote | |,) to any finite field extension of Q,. 
Since the algebraic closure @, of Q, is the union of such extensions, | |, 
extends uniquely to O,. Concretely speaking, if « € Q, has monic irreducible 
polynomial x" + a,x"~1+.---+a,, then |a|, = |a,|}!". 

Let K be an extension of Q, of degree n. For « € K* we define 


1 
ord, det —log, ||, = —log, INxia,(@)15" =e log, INxia,()l»- 


This agrees with the earlier definition of ord, when « € Q,, and clearly has 
the property that ord, «8 = ord, a + ord, 8. Note that the definition of 
ord, « is independent of the choice of field K such that a € K, [K: Q,] < 00. 
The image of K* under the ord, map is contained in (1/n)Z = {x e€ Q| 
nx € Z}. Since this image is a nontrivial additive subgroup of (1/n)Z that 
contains Z, it must be of the form (1/e)Z for some positive integer e dividing 
n. This integer e is called the “index of ramification” of K over Q,. Ife = 1, 
we say that K is an unramified extension of Q,. Now let x € K be any element 
such that ord, x = (1/e). Then clearly any x € K can be written uniquely in 
the form 


au, where |u|, = 1 and me Z (in fact, m = e- ord, x). 


It can be proved (Exercise 12 below) that n = e-f, where n = [K: Q,], 
e is the index of ramification, and f is the degree of the residue field A/M 
over F,. In any case, we’ve already seen that f < n and e < n. In the case of 
an unramified extension K. i.e., when e = 1, we may choose p itself for 7 
in the preceding paragraph, since ord, p = 1 = (l/e). At the other extreme, 
if e = n, the extension K is called totally ramified. 


Proposition. If K is totally ramified and m € K has the property ord, 7 = (1/e), 
then 7 satisfies an “‘ Eisenstein equation” (see Exercise 14 of §1.5) 
xX? + a,_\xX*-1+---+ a, =0, a,€Z,, 
where a, = 0 (mod p) for all i, and ay # 0 (mod p?). Conversely, if « is a 
root of such an Eisenstein equation over Q,, then Q,(«) is totally ramified 
over Q, of degree e. 


Proor. Since the a; are symmetric polynomials in the conjugates of 7, all of 
which have | |, = p~?/, it follows that |a,|, < 1. As for ao, we have |ao|, = 
|7|,° = I/p. 

Conversely, we saw in Exercise 14 §I.5 that an Eisenstein polynomial is 
irreducible, so that adjoining a root « gives us an extension of degree e. 
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Since ord, d) = 1, it follows that ord, « = (1/e) ord, a) = (1/e), and hence 
Q,(a) is totally ramified over Q,. oO 


A more precise description of the types of roots of polynomials that can 
be used to get a totally ramified extension of degree e can be given if e is not 
divisible by p (this case is called “‘tame”’ ramification; ple is called “wild” 
ramification). Namely, such tamely totally ramified extensions are obtained 
by adjoining solutions of the equation x*° — pu = 0, where ue Z,”, i.e., such 
extensions are always obtained by extracting an eth root of p times a p-adic 
unit (see Exercises 13 and 14 below). 

Now let K be any finite extension of Q,. The next proposition tells us that 
if K is unramified, i.e.,e = 1, then K must be of a very special type, namely, a 
field obtained by adjoining a root of 1; while if K is ramified, it can be 
obtained by first adjoining a suitable root of 1 to obtain its “‘maximal un- 
ramified subfield” and then adjoining to this subfield a root of an Eisenstein 
polynomial. Warning: the proof of the following proposition is slightly 
tedious, and the reader who is impatient to get to the meatier material in the 
next chapter may want to skip it (and also skip over some of the harder 
exercises in §III.4) on a first reading. 


Proposition. There is exactly one unramified extension K7"**™ of Q, of degree f, 
and it can be obtained by adjoining a primitive (p' — 1)th root of 1. If Kis an 
extension of Q, of degree n, index of ramification e, and residue field degree f 
(so that n = ef, as proved in Exercise 12 below), then K = Kj7""*™(r), 
where 7 satisfies an Eisenstein polynomial with coefficients in K7°™°™. 


ProoF. Let & be a generator of the multiplicative group Fy (see the proposition 
at the end of §1), and let P(x) = x! + @x'-1 +--- + d,,a,€ F,, be its monic 
irreducible polynomial over F, (see Exercise 5 of §1). For each i, let a, € Z, be 
any element which reduces to a, mod p, and let P(x) = xf + ayxf-2 +--+ 
a,. Clearly, P(x) is irreducible over Q,, since otherwise it could be written as a 
product of two polynomials with coefficients in Z,, and each could be 
reduced mod p to get P(x) as a product. Let « € Q2!**! be a root of P(x). Let 
K = Q,(a), A = {xe K| |x|, < 1}, M = {xe K| |x|, < 1}. Then [K: Q,] = 
f, while the coset « + M satisfies the degree f irreducible polynomial P(x) 
over F,. Hence [4/M: F,] = f, and K is an unramified extension of degree /. 
(We have not yet proved that it is the only one.) 

Now let K be as in the second part of the proposition. Let A= 
{xe R| |x],<1} be the valuation ring of ||, in K, and let M= 
{x € R| |x|, < 1} be the maximal ideal of A, so that A/M = F,,. Let @€ Fy 
be a generator of the multiplicative group Fx. Let a) €.A be any element 
that reduces to & mod M. Finally, let x ¢ K be any element with ord, x = 
l/e; thus, M = 7A. 

We claim that there exists « = a mod 7 such that o-1 — 1 = 0. The 
proof is a Hensel’s lemma type argument. Namely, we write a = a + a,7 
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(mod 7?), so that mod 7? we need 0 = (a + a7)”-!— 1 Sa -t— 1 + 
(pl — Voymab’-2 = ab’-! — 1 — ana?’ -2 (mod 7”). But «8-1 = 1 (mod 7), 
so that, if we set «, = («8’~! — 1)/(wa?’~?) (mod =), then we get the desired 
congruence mod 7”. Continuing in this way, just as in Hensel’s lemma, we 
find a solution « = a» + o,7 + agr? +--- to the equation «”-! = 1. 

Note that «,«?,...,a”-1 are all distinct, because their reductions 
mod M—a, @,...,a@”-1—are distinct. In other words, « is a primitive 
(p’ — 1)th root of 1. Also note that [Q,(«):Q,] = f, since f is the residue 
field degree of the extension. (We will soon prove that (O,(«):Q,] = f) 

The above discussion applies, in particular, to the field K constructed in 
the first paragraph of the proof. Hence K > Q,(«), where a is a primitive 
(p! — 1)th root of 1. Since f = [K:Q,] > [Q,(«):Q,] =f, it follows that 
K= Q,(«). Thus, the unramified extension of degree f is unique. Call it 
Kunan 

‘We now return to our field K of degree n = ef over Q,. Let E(x) be the 
monic irreducible polynomial of 7 over K = K?""*™. Let {n,} be the 
conjugates of 7 over K7""*™, so that E(x) = [] (x — 7;). Let d be the degree 
and c the constant term of E(x). Then ord, c = dord, 7 = d/e. But since 
ef =n = [K:Q,] = [K: Kp2"*™)[ Ky": Q,] = [K: K72"°™]-f, it follows that 
d < e. Since ce K¥"™*™, ord, c is an integer. We conclude that d = e, and 
ord, c = 1. Thus, E(x) is an Eisenstein polynomial, and K = K¥"™™(7). 


Corollary. If K is a finite extension of Q, of degree n, index of ramification e, 
and residue field degree f, and if 7 chosen so that ord, 7 = 1/e, then every 
a € K can be written in one and only one way as 


ao 
>, an 
i=m 


where m = e ord, «and each a; satisfies a,°' = a, (i.e., the a,’s are Teich- 
miiller digits). 


The proof of the corollary is easy, and will be left to the reader. 


If m is any positive integer not divisible by p, we can find a power p’ of p 
which is congruent to 1 mod m (namely, let f be the order of p in the multipli- 
cative group (Z/mZ)” of residue classes mod m of integers prime to m). Then, 
if p’ — 1 = mm’, and if we adjoin to Q, a primitive (p’ — 1)th root a of 1, 
it follows that «”’ is a primitive mth root of 1. Hence, we may conclude that 
finite unramified extensions of Q, are precisely the extensions obtained by 
adjoining roots of \ of order not divisible by p. 

The union of all the finite unramified extensions of Q, is written Q>"™°™ 
and is called the ‘maximal unramified extension of @,.” The ring of integers 
Zanram of GF (also called the ‘valuation ring”’), which is 


Zenram & {xe€ qyusrem | |x| p < 1}, 
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has a (unique) maximal ideal M"""™ = pZperem = {x © Opre™ | |x|, < 1} = 
{x © Qypnram | |x|, < 1/p}. The residue field Zp"™*™/pZ}4"*™ is easily seen to 
be the algebraic closure F, of F,. Every x¢F, has a unique “Teichmiiller 
representative” x € Zj"™™ which is a root of 1 and has image x in Z3"™9™/ 
pzs""*™. For this reason, Z5™™*™ is often called the “lifting to characteristic 
zero of F,” (also called the ‘Witt vectors of F,”’). 

Querem which is a much smaller field than Q2'**, can be used instead of 
Qs'*° in many situations. 

The “opposite” of unramified extensions is totally ramified extensions. 
We can get a totally ramified extension, for example, by adjoining a primitive 
pth root of 1—this will give us a totally ramified extension of degree 
n =e = p’"\(p — 1) (see Exercise 7 below). However, unlike in the un- 
ramified case, not by a long shot can all totally ramified extensions be ob- 
tained by adjoining roots of 1. For example, adjoining a root of x" — p 
clearly gives a totally ramified extension K of degree m; but if K were contained 
in the field obtained by adjoining a primitive p’th root of 1, we would have 
m|p’-1(p — 1), which is impossible if, say, m > p and ptm. About all we 
can say about the set of all totally ramified extensions of @, is contained 
in the proposition at the beginning of this section and in Exercise 14 
below. 

We repeat: An extension K of Q, of degree n, index of ramification e, and 
residue field degree f is obtained by adjoining a primitive (p‘ — 1)th root 
of | and then adjoining to the resulting field KY°"*™ a root of an Eisenstein 
polynomial with coefficients in KY2"™*™, 

We conclude this section with a couple of useful propositions. 


Proposition (Krasner’s Lemma). Let a, b « Q, (=Q3**), and assume that b 
is chosen closer to a than all conjugates a, of a (a, # a), i.e., 


|b — aly < |a, — aly. 
Then Q,(a) < Q,(8). 
Proor. Let K = Q,(b), and suppose a¢ K. Then, since a has conjugates 
over K equal in number to [K(a):K], which is >1, it follows that there is at 
least one a,¢ K, a; # a, and there is an isomorphism o of K(a) to K(a;) 
which keeps K fixed and takes a to a,. We already know, because of the 
uniqueness of the extension of norms, that |ox|, = |x|, for all x € K(a). In 
particular, |b — a,|, = |ob — cal, = |b — al,, and hence 
la, — al, < max(|a, — 5|,, |b — al,) = |b — aly < |a, — aly, 


a contradiction. O 


Note that Krasner’s Lemma can be proved in exactly the same way in a 
more general situation: If a,be @,, K is a finite extension of Q,, and for all 
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conjugates a, of a over K (a, # a) we have |b — al, < |a, — aly, then K(a) ¢ 
K(d). 

Now let K be any field with a norm || ||. If 4 g¢ K[X], ie, f = > aX! 
and g = > 6,X' are two polynomials with coefficients in K, we define the 
distance || f — g|| from f to g as 


lf - gl Ger MAX; la; — 5, |. 


Proposition. Let K be a finite extension of Q,. Let f{(X) € K[X] have degree n 
F(X) = a,X" + aye X"71 +--+ + aX + ap. 


Suppose the roots of f in Q, are distinct. Then for every « > 0 there exists a& 
such that, if g = dt. b,X' € K[X] has degree n, and if |f — g\p < 4, then 
for every root «, of f(X) there is precisely one root B, of g(X) such that 
| oe, oes Bilp <eé. 


Proor. For each root B of g(X) we have 


le = LB) — elo = » (a — b)B'l, 
< max(|4, a bilp |Blp') 
< \f- 8lp max(I, (Blp") < 6C,", 


where C, is a suitable constant (see Exercise 3 below). 

Let C, = min, <;<j <n |; — %;|p- Since the a,’s are distinct, we have C, #0. 
Then the relation |B — a|, < Cy, is only possible for at most one a; (since if it 
held for another «; # a, we’d have |a, — a,|, < max(|«, — Bl,, |B — |p) < 
C,). Since 


C,"3 > |f(B)|> 
= |a,[ [(B — |p (since f(X) = a, [ | (X — )) 
= lanl>] | |B = ap, 


it’s clear that for 8 sufficiently small such an a with |B — |, < C, must 
exist. Moreover, for that «, we have: 


2-4 
lanl> I I |B = a,|p 
S# 


C,"8 
< 
~ |@n|p- C37? 


which can be made <e by a suitable choice of 6. O 
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So far we’ve been dealing only with algebraic extensions of Q,. But, as 
mentioned before, this is not yet enough to give us the p-adic analogue of the 
complex numbers. 


Theorem 12. Q, is not complete. 


Proor. We must give an example of a Cauchy sentence {a,;} in @, such that 
there cannot exist a number a € Q, which is the limit of the q,. 

Let b; be a primitive (p? — 1)th root of 1 in G,, ie., bP? -1 = 1, but 
by" #4 Lifm < p* — 1. Note that b?”"-! = 1 if i’ > i, because 2'|2" implies 
p? — 1 |p” — 1. (In fact, instead of 2' we could replace the exponent of p 
by any increasing sequence whose ith term divides its (i + 1)th, e.g., 3‘, i!, 
etc.) Thus, if i’ > i, 5, is a power of 5. Let 


1 
Qa = > b,p™), 
J=0 


where 0 = Ng < N, < N2 <.--- is an increasing sequence of nonnegative 
integers that will be chosen later. Note that the b;,j = 0,1, ..., i, are the 
digits in the p-adic expansion of a, in the unramified extension Q,(5,), since 
the 5, are Teichmiiller representatives. Clearly {a,} is Cauchy. 

We now choose the N,, j > 0, by induction. Suppose we have defined N, 
for j < i, so that we have our a, = >{.9b,;p"s. Let K = Q,(6,). In §3 we 
proved that K is a Galois unramified extension of degree 2'. First note that 
Q,(a;) = K, because otherwise there would be a nontrivial Q,-automorphism 
o of K which leaves a; fixed (see paragraph (11) in §1). But o(a;) has p-adic 
expansion L}_ 9 o(b;)p™, and o(b;) # b;, so that o(a;) # a, because they have 
different p-adic expansions. 

Next, by exercise 9 of §III.1, there exists N,,, > N, such that a, does not 
satisfy any congruence 


Ons” + oy yap-> +--+ + aya, + a = 0 (mod p™+1) 


for n < 2! and a, eZ, not all divisible by p. 
This gives us our sequence {a,}. 
Suppose that ae Q, were a limit of {a,}. Then a satisfies an equation 


a,a" + a,_;a"- 1 +--+ +4 aa + a = 0, 


where we may assume that all of the a, € Z, and not all are divisible by p. 
Choose i so that 2‘ > n. Since a = a, (mod p™:+1), we have 


OQ” + Oy QP > + +++ + aa; + & = O(mod p™+1), 


a contradiction. This proves the theorem. oO 
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Note that we have actually proved that Q3""*™, not only @, = Q3!**, is 
not complete. 

So we now want to “‘fill in the holes,” and define a new field Q to be the 
completion of @,. Strictly speaking, this means looking at equivalence classes 
of Cauchy sequences of elements in @, and proceeding in exactly the same 
way as how Q, was constructed from @ (or how R was constructed from Q, 
or how a completion can be constructed for any metric space). Intuitively 
speaking, we’re creating a new field by throwing in all numbers which are 
convergent infinite sums of numbers in Q,, for example, of the type considered 
in the proof of Theorem 12. 

Just as in going from Q to Q,, in going from Q, to Q we can extend the 
norm | |, on @, to a norm on Q be defining |x|, = limy..|x;|,, where {x} 
is a Cauchy sequence of elements in @, that is in the equivalence class of x 
(see §1.4). As in going from to Q,, it is easy to see that if x # 0 this limit 
|x|, is actually equal to |x,|, for i sufficiently large. 

We also extend ord, to Q: 


ord, x = —log,|x|». 


Let A = {xEQ||x|, < 1} be the “valuation ring” of Q, let M= 
{x €Q||x|, < 1} be its maximal ideal, and let A* = {x€Q||x|, = 1} = 
A—M be the set of invertible elements of A. Suppose that xe A%, ie, 
|x|, = 1. Since @, is dense in Q, we can find an algebraic x’ such that 
x —x’eM, ie. |x — x’|, < 1. Since then |x’|, = 1, it follows that x’ is 
integral over Z,, i.e., it satisfies a monic polynomial with coefficients in Z,. 
Reducing that polynomial modulo p, we find that the cosetx + M=x'+M 
is algebraic over F,, i.¢., lies in some Fs. Now let a(x) be the (p’ — 1)th root 
of 1 which is the Teichmiiller representative of x + MeF,,, and set <x) = 
x/a(x). Then <x> €1 + M. In other words, any x € A” is the product of a 
root of unity w(x) and an element <x) which is in the open unit disc about 1. 
(If x € Z, has first digit ay, this simply says that x is the product of the Teich- 
miiller representative of ay and an element of 1 + pZ,,.) Finally, an arbitrary 
nonzero x €Q can be written as a fractional power of p times an element 
x,€Q of absolute value 1. Namely, if ord, x = r = a/b (see Exercise 1 
below), then let p” denote any root in Q, of the polynomial X° — p*. Then 
x = p’x, = p'w(x,)<x,> for some x, of norm 1. In other words, any nonzero 
element of Q is a product of a fractional power of p, a root of unity, and an 
element in the open unit disc about \. 

The next theorem tells us that we are done: will serve as the p-adic 
analogue of the complex numbers. 


Theorem 13. Q is algebraically closed. 


Proor. Let: f(X) = X" + a,_,X"7} +--+ + aX + a, a,€Q. We must 
show that f(X) has a root in Q. For each i = 0,1, ...,m — 1, let {a,;}, be a 
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sequence of elements of @, which converge to a, Let g,X) = X" + 
Qn —1gX"7 2 + +++ + a jX + ao; Let r;; be the roots of g,(X) @ = 1,2, ..., 
n). We claim that we can find i; (1 < i; < n) for j = 1, 2,3, ... such that 
the sequence {r,, ;} is Cauchy. Namely, suppose we have r,, ; and we want to 
find ri, ,,,;41- Let 8; = |g; — gy4ilp = max,(|a;,; — a;,;+1|p) (which approaches 
0 as j > 00). Let A; = max(I, |r;,,;|>"). Clearly there is a uniform constant A 
such that A; < A for all j (see Exercise 3 below). Then we have 


I] Figs — Tijsilp = lgj+ilriy. dp 


[sis as BATA |p 
< 8,A. 


Hence at least one of the |r; — 1,541], on the left is < V8,A. Let ri,,,;+1 be 
any such r,,;41- Clearly this sequence of r,,; is Cauchy. 

Now let r = lim, 7,,;€ Q. Then f(r) = limj.0 /(ri,,;) = lim;+ 0 gri,,;) 
= 0. oO 


Summarizing Chapters I and III, we can say that we have constructed Q, 
which is the smallest field which contains © and is both algebraically closed 
and complete with respect to | |,. (Strictly speaking, this can be seen as 
follows: let Q’ be any such field; since ’ is complete, it must contain a field 
isomorphic to the p-adic completion of Q, which we can call Q,; then, since 
Q’ contains Q, and is algebraically closed, it must contain a field isomorphic 
to the algebraic closure of Q,, which we can call Q,; and, since ’ contains 
@, and is complete, it must contain a field isomorphic to the completion of 
Q,, which we call Q. Thus any field with these properties must contain a 
field isomorphic to 2. The point is that both completion and algebraic 
closure are unique processes up to isomorphism.) 

Actually, Q should be denoted Q,, so as to remind us that everything we’re 
doing depends on the prime number p we fixed at the start. But for brevity of 
notation we shall omit the subscript p. 

The field Q is a beautiful, gigantic realm, in which p-adic analysis lives. 


EXERCISES 


1. Prove that the possible values of | |, on Ox is the set of all rational powers 
of p (in the positive real numbers). What about on 2? Recall that we let the 
ord, function extend to Q” by defining ord, x = —log, |x|, (i.e., the power 
I/p is raised to get |x|,). What is the set of all possible values of ord, on Q*? 
Now prove that @, and © are not locally compact. This is one striking 
difference with C, which is locally compact under the Archimedean metric 
(the usual definition of distance on the complex plane). 
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What happens if you define an “ellipse” in Q to be the set of points the sum 
of whose distance from two fixed points a, b € Q is a fixed real number r? 
Show that this “‘ellipse”’ is either two disjoint circles, the intersection of two 
circles, or the empty set, depending on a, b, and r. What do you get if you 
define a “‘hyperbola” as {xe | |x -— al, — |x — bl, =r}? 


. Let g(X) = X" + bX") +--+ + bX + bo. Let Cy = max;|b;|,. Show that 


there exists a constant C, depending only on Cy such that any root B of g(X) 
satisfies |B], < Cy. 


Let a be a root of a monic irreducible polynomial f(X) € K[X], where K is a finite 
extension of Q,. Prove that there exists an e > 0 such that any polynomial g(X) 
having the same degree as f and satisfying | f — g|, < ¢ has a root B such that 
K(a) = K(f). Show that this is not necessarily the case if fis not irreducible. 


. Prove that any finite extension K of @, contains a finite extension F of the 


rational numbers @ such that [F: Q] = [K: Q,] and F is dense in K, i.e., 
for any element x € K and any e > 0 there exists y € F such that |x — yl, < e«. 


Let p be a prime such that — 1 does not have a square root in Q, (see Exercise 
8 of §III.1). Use Krasner’s Lemma to find an e such that Q,(V—a) = 
Q,(V —1) whenever |a — 1|, < e. For what e¢ does |a — p|, < e imply 
Q,(Va) = Q,(V p)? (Treat the case p = 2 separately.) 


Let a be a primitive p’th root of 1 in G,, ie, a? # 1. Find |a — 1],. In the case 
n= | show that Q,(a) = Q,() where £ is any root in Q, of X?~* + p = 0. Also 
show that |a — 1|, = 1 if ais a primitive mth root of 1 and mis not a power of p. 


. Let K be a finite extension of Q,. Let m be a positive integer, and let (K*)” 


denote the set of all mth powers of elements of K *. Suppose that (1) |m|, = 1, 
and (2) K contains no mth roots of 1 other than 1 itself. (For example, if 
K = Q,, these two conditions both hold if and only if m is relatively prime 
to both p and p — 1, as you can prove as an exercise.) Prove that the index 
of (K*)" as a multiplicative subgroup in K* (i.e., the number of distinct 
cosets) is equal to m. 


. If in the previous exercise we remove the assumption that K contains no nontrivial 


m-th roots of 1, show that the index of (K *)” in K * equals mw, where w is the number 
of m-th roots of | contained in K. 


. If K is a totally ramified extension of Q,, show that every mth root of 1 in K 


is in Q, if p does not divide m. 


. Determine the cardinality of the sets Q,, @,, and Q. 


. Prove that ef = n, where n = [K: Q,], e is the index of ramification, and f 


is the residue field degree. 


Let K be a totally ramified extension of Q, of degree e. Show that there exists 
Be K such that |B* — «|, < 1/p for some « € Z, with ord, a = 1. 


Suppose K is tamely totally ramified. Using a Hensel’s lemma type argument, 
show that f can be further adjusted so that B* € Qy, i.e., B satisfies X¥° — « = 
0, where « € Z, and ord, a = 1. Note that K = Q,(8) (explain why). 


The complex numbers C are much more numerous than the rational numbers, 
or even the algebraic numbers, because the latter sets are only countably 
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infinite, while C has the cardinality of the continuum. Q is also much, much 
bigger than Q?!8*!, although not in precisely that way (see Exercise 11 above). 
Prove that there does not exist a countably infinite set of elements of Q such 
that Q is an algebraic extension of the field obtained by adjoining all those 
elements to Q, (i.e., the field of all rational expressions involving those 
elements and elements of Q,). One says that Q has “uncountably infinite 


transcendence degree over Q p: (Warning: this exercise and the next are hard!) 


Does Q have countably infinite transcendence degree over the p-adic comple- 
tion of Qi"? 
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CHAPTER IV 


p-adic power series 


1. Elementary functions 


Recall that in a metric space whose metric comes from a non-Archimedean 
norm || ||, a sequence is Cauchy if and only if the difference between adjacent 
terms approaches zero; and if the metric space is complete, an infinite sum 
converges if and only if its general term approaches zero. So if we consider 
expressions of the form 


S(X) = > aX, aeQ, 
n=0 


we can give a value >. a,x" to f(x) whenever an x is substituted for X for 
which |a,x"|, — 0. 
Just as in the Archimedean case (power series over R or C), we define the 
“radius of convergence” 
r= roma Sees ? 
lim sup|a,|}" 
where the terminology “‘1/r = lim sup|a,|}/"” means that 1/r is the least real 
number such that for any C > 1/r there are only finitely many |a,|}’" greater 
than C. Equivalently, 1/r is the greatest “point of accumulation,” i.e., the 
greatest real number which can occur as the limit of a subsequence of 
{|a,|3'"}. If, for example, lim,...|a,|}’" exists, then 1/r is simply this limit. 
We justify the use of the term “radius of convergence” by showing that 
the series converges if |x|, < rand diverges if |x|, > r. First, if |x|, < r, then, 
letting |x|, = (1 — «)r, we have: |a,x"|, = (rla,|3")"(1 — ¢)". Since there are 
only finitely many » for which |a,|}!/" > 1/(r — 4er), we have 


sd els cin EME ct =z)’ - 
lim asaly = tim (F=H5,)" = jim (Fp)" = 0 
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Similarly, we easily see that if |x|, > r, then a,x" does not approach 0asn —> 00. 
Finally, if r = 00 it is easy to check that lim, ,, |a,x"|, = 0 for all x. 

What if |x|, = r? In the Archimedean case the story on the boundary of 
the interval or disc of convergence can be a little complicated. For example, 
log(] + x) = 39.1 (—1)"*'!x"/n has radius of convergence !. When |x| = 1, 


it diverges for x = — 1 and converges (‘‘conditionally,”’ not ‘absolutely ’’) for 
other values of x (i.e., for x = | in the case of the reals and on the unit circle 
minus the point x = —1 in the case of the complexes). 


But in the non-Archimedean case there’s a single answer for all points 
|x|, = r. This is because a series converges if and only if its terms approach 
zero, i.e., if and only if |a,|,|x|% > 0, and this depends only on the norm |x|, 
and not on the particular value of x with a given norm—there’s no such thing 
as “conditional” convergence (> + a, converging or diverging depending on 
the choices of +’s). 

If we take the same example >2., (—1)"*'X*/n, we find that |a,|, = 
pees”, and lim,...{a,|}’" = 1. The series converges for |x|, < 1 and diverges 
for |x|, > 1. If |x|, = 1, then |a,x"|, = ps" > 1, and the series diverges 
for all such x. 

Now let’s introduce some notation. If R is a ring, we let R{[X]] be the ring 
of formal power series in X with coefficients in R, i.e., expressions >= 0@,X", 
a, € R, which add and multiply together in the usual way. For us, R will 
usually be Z, Q, Z,, Q,, or Q. We often abbreviate other sets using this 
notation, for example, 


1+ XRILX]] = (Ye R[[X]] | constant term dp of fis I}. 


der 
We define the “‘closed disc of radius re R about a point a é€ Q”’ to be 
DAr) =, {x €Q| |x — al, < 7}, 


and we define the “‘open disc of radius r about a” to be 


DAr-) = {xe Q| |x — aly < r}. 


We let D(r) =, Do(r) and D(r~) = Do(r~). (Note: whenever we refer to the 


closed disc D(r) in Q, we understand r to be a possible value of | px 1.58 
positive real number that is a rational power of p; we always write D(r7) if 
there are no x € Q with |x|, =r.) 

(A word of caution. The terms “closed” and “‘open”’ are used only out of 
analogy with the Archimedean case. From a topological point of view the 
terminology is bad. Namely, for positive c the set C, = {x €Q| |x — al, = c} 
is open in the topological sense, because every point x € C, has a disc about 
it, for example D,(c~ ), all points of which belong to C,. But then any union 
of C,’s is open. Both D,(r) and D,(r7), as well as their complements, are such 
unions: for example, D,(r~) = (),<,C,. Hence both D,(r) and D,(r~) are 
simultaneously open and closed sets. The term for this peculiar state of affairs 
in Q is “totally disconnected topological space. ”’) 

Just to get used to the notation, we prove a trivial lemma. 
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Lemma 1. Every f(X) € Z,[[X]] converges in D(1~). 
Proor. Let f(X) = d7-0 GnX", 4,€Z,y, and let xe D(1~). Thus, |x|, < 1. 
Also |a,|, < 1 for all n. Hence |a,x"|, < |x|,"—>0asn—>o. im 


Another easy lemma is 


Lemma 2. Every f(X) = dre0dnX” € Q[LX]] which converges in an (open or 
closed) disc D = D(r) or D(r~) is continuous on D. 


Proor. Suppose |x’ — x|, < 5, where 5 < |x|, will be chosen later. Then 
|x|» = |x|). (We are assuming x 4 0; the case x = 0 is very easy to check 
separately.) We have 


f(x) — SO)» 


eo 
> (a,x" — a,x'") 
n=0 Dp 


lA 


max,|a,x" — a,x'"|, 


max,(|a,|p|(x — x’)x"~* + x™~9x’ +--- 


+ xx'n~ 2 4 x'A-11 2), 


But [x"™-2 + x-2x! +e xx’ 2 + x’ 1S < max, cicn|x" x], = 
|x|$~+. Hence 


If) — fp S max,(|x — x']p|an]1x12~*) 


3 n 
< Tey m%a(l@nlol +l ): 
Since |a,|,|x|," is bounded as n—~ oo, this |f(x) — f(x’)|, is <e for suit- 
able 8. Oo 


Now let’s return to our series >.7- ,(— 1)"*1.X"/n, which, as we’ve seen, has 
disc of convergence D(1~). That is, this series gives a function on D(1~) 
taking values in Q. Let’s call this function log,(1 + X), where the subscript p 
reminds us of the prime which gave us the norm on Q used to get Q, and 
also reminds us not to confuse this function with the classical log(1 + X) 
function—which has a different domain (a subset of R or C) and range 
(R or C). Unfortunately, the notation log, for the ‘“‘p-adic logarithm”’ is 
identical to classical notation for “‘log to the base p.”” From now on, we shall 
assume that log, means p-adic logarithm 


log,(1 + X):D(1-) >, — log,(1 + x) = > (—1)"*1x"/n, 
n=1 
unless explicitly stated otherwise. 


The dangers of confusing Archimedean and p-adic functions will be 
illustrated below, and also in Exercises 8-10 at the end of §1. 
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Anyone who has studied differential equations (and many who haven’t) 
realize that exp(x) = e* = Dy x"/n! is about the most important function 
there is in classical mathematics. So let’s look at the series >¥-> X"/n! 
p-adically. The classical exponential series converges everywhere, thanks to 
the n! in the denominator. But while big denominators are good things to 
have classically, they are not so good p-adically. Namely, it’s not hard to 
compute (see Exercise 14 §I.2) 


Sn 


ord,(n!) = ai (S, = sum of digits in 1 to base p); 


-1 


|1/n!|, = po Seip 1), 
Our formula for the radius of convergence r = 1/(lim supja,|}’") gives us 
ord, r = lim inf Lorde a,}> 
n 


(where the “‘lim inf” of a sequence is its smallest point of accumulation). In 
the case a, = 1/n!, this gives 
ord, r = lim int (-7-—5+), 

but lim,..(—(@ — S,)/(n(p — 1))) = —1/(p — 1). Hence >3_, x"/n! ‘con- 
verges if |x|, < p~1/®-» and diverges if |x|, > p~@-». What if |x|, = 
p V@-, Le., ord, x = 1/(p — 1)? In that case 

ord,(a,x") = a a = i= a 1 
If, say, we choose n = p™ to be a power of p, so that S, = 1, we have: 
ord,(apnx?") = 1/(p — 1), |apmx?"|, = p7~/-», and hence a,x" + 0 as n—> 
oo. Thus, 57> X"/n! has disc of convergence D(p~1?~»-) (the — denoting 
the open disc, as usual). Let’s denote exp,(X) = dr=o X"/n! € Q,[[X]]. 

Note that D(p~?/?-?-) < D(I-), so that exp, converges in a smaller 
disc than log,! 

While it is important to avoid confusion between log and exp and log, 
and exp,, we can carry over some basic properties of log and exp to the p-adic 
case. For example, let’s try to get the basic property of log that log of a 
product equals the sum of the logs. Note that if x e D(|~) and y € D(1-), then 
also (1+ xi +y)=14+ (@+y+4+xy)e1 + D(I-). Thus, we have: 


loggl(l + 291 + y= > Ia + y+ ayn 


Meanwhile, we have the following relation in the ring of power series over Q 
in two indeterminates (written Q[[X, Y])): 


> (=) Xn + D(H en = DS (HK + V+ XY/)Yn. 
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This holds because over R or C we have log(1 + x)(1 + y) = log(1] + x) + 
log(1 + y), so that the difference between the two sides of the above equality— 
call it F(X, Y)—must vanish for all real values of X¥ and Y in the interval 
(-1, 1). So the coefficient of X¥" Y" in F(X, Y) must vanish for all m and n. 

The argument for why F(X, Y) vanishes as a formal power series is 
typical of a line of reasoning we shall often need. Suppose that an expression 
involving some power series in X and Y—e.g., log(1 + X), log(l1 + Y), and 
log(1 + X + Y + XY)—-vanishes whenever real values in some interval are 
substituted for the variables. Then when we gather together all ¥" Y*-terms in 
this expression, its coefficient must always be zero. Since this is a general fact 
unrelated to p-adic numbers, we won’t digress to prove it carefully here. 
But if you have any doubts about whether you could prove this fact, turn to 
Exercise 21 below for further explanations and hints on how to prove it. 

Returning to the p-adic situation, we note that if a series converges in Q, 
its terms can be rearranged in any order, and the resulting series converges 
to the same limit. (This is easy to:check—it’s related to there being no such 
thing as “conditional” convergence.) Thus, log,[(1 + x)(1 + y)] = S%.1 
(-—1)"***(x + y + xy)"/n can be written as Dmn-0Cmnx"y™. But the 
“formal identity” in Q[[X, Y]] tells us that the rational numbers c,,, will be 
O unless m = 0 or m = 0, in which case: Con = Cao = (—1)"**/n (Coo = 0). 
In other words, we may conclude that 


log,{(l + aX + 9) =D rain +S (Dy 


= log,(1 + x) + log,(1 + y). 


As a corollary of this formula, take the case when 1 + xis a p™th root of 1. 
Then |x|, < 1 (see Exercise 7 of §III.4), so that: p"™ log,(1 + x) = log, 
(1 + x)?" = log, 1 = 0. Hence log,(1 + x) = 0. 

In exactly the same way we can prove the familiar rule for exp in the p-adic 
situation: if x, ye D(p-/@-»-), then x + ye D(p-/@-)-), and exp,(x + y) 
= EXP, X- EXPy J. 

Moreover, we also find a result analogous to the Archimedean case as far 
as log, and exp, being inverse functions of one another. More precisely, 
suppose x € D(p~1/?-»-), Then exp, x = 1 + >%., x"/n!, and ord,(x"/n!) > 
n{(p — 1) — (» — S,)/(p — 1) = S,/( — 1) > 0. Thus, exp, x — 1 € D(I>). 
Suppose we take 

log,(1 + exp x — 1) = > (-1)"**(exp, x — 1)"/n 


n=1 


=> (=9( > jm!) [n 
n=1 mel 
But this series can be rearranged to get a series of the form D7; c,x". And 
reasoning as before, we have the following formal identity over Q[[X, Y]]: 
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coming from the fact that log(exp x) = x over R or C. Hence c, = 1, c, = 0 
for n > 1, and 


log,(1 + exppx — 1) =x forxe D(p-'-»-), 


To go the other way—i.e., exp,(log,(1 + x))—we have to be a little 
careful, because even if x is in the region of convergence D(1-) of log,(1 + X), 
it is not necessarily the case that log,(1 + x) is in the region of convergence 
D(p-"'-»-) of exp, X. This is the case if x € D(p~/~—), since then for 
noi: 


1 n 1 n—I1 
n — eee ee ee ee ne Oe 
(ord, x"/n) ag > oot ord, n = = ord, ”, 
which has its minima at n = 1 and n = p, where it’s zero. Thus, ord, log, 
(1 + x) > min, ord, x"/n > 1/(p — 1). Then everything goes through as 
before, and we have: 


exp,(log,(1 + x))=1+-x for xe D(p-¥?-)-), 


All of the facts we have proved about log, and exp, can be stated succinctly 
in the following way. 


Proposition. The functions log, and exp, give mutually inverse isomorphisms 
between the multiplicative group of the open disc of radius p~*-» about | 
and the additive group of the open disc of radius p~*"®-» about 0. 
(This means precisely the following: log, gives a one-to-one correspondence 
between the two sets, under which the image of the product of two numbers 
is the sum of the images, and exp, is the inverse map.) 


This isomorphism is analogous to the real case, where log and exp give 
mutually inverse isomorphisms between the multiplicative group of positive 
real numbers and the additive group of all real numbers. 

In particular, this proposition says that log, is injective on D,(p~-?-), 
ie., no two numbers in D,(p~!/?-»-) have the same log,. It’s easy to see 
that D,(p~!?~»-) is the biggest disc on which this is true: namely, a primitive 
pth root ¢ of I has |¢ — 1|, = p-?-» (see Exercise 7 of §II1.4), and also 
log, ¢ = 0 = log, |. 

We can similarly define the functions 


sinp: D(p7?-)-) > Q, sinp X = > (-—1)?X7"*1/(2n + 13; 
n=0 
cos,: D(p-"?-Y-) > Q, cos, X = SY (—1)"X2"/(2n)!. 
n=0 
Another function which is important in classical mathematics is the 
binomial expansion B,(x) = (1 + x)* = S®.,a(a — 1)---(a—n + I)/n! x". 


For any ae R or C, this series converges in R or C if |x| < 1 and diverges 
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if |x| > 1 (unless a is a nonnegative integer); its behavior at |x| = 1 is a 
little complicated, and depends on the value of a. 
Now for any aéQ let’s define 


BAX) =, S a(a — DG —n+1) 


n=0 
and proceed to investigate its convergence. First of all, suppose |a|, > 1. Then 
|a — i|, = |a|,, and the nth term has | |, equal to |ax|,"/|n!|,. Thus, for 
|a|, > 1, the series B,,,(X) has region of convergence D((p~1~?)/|a|,~). 

Now suppose |a|, < 1. The picture becomes more complicated, and 
depends on a. We won’t derive a complete answer. In any case, for any such a 
we have |a — i|, < 1, and so |a(a — 1)---(a —n + 1)/n!x"|, < |x"/n!|,, so 
that at least B,,,(X) converges on D(p~ ¥@~P-). 

We'll soon need a more accurate result about the convergence of B,.,(X) 
in the case when ae Z,. We claim that then B,,,(X) € Z,[[X]] (and, in particu- 
lar, it converges on D(i-) by Lemma 1). Thus, we want to show that 
ala — 1)---(a—n+ 1)/n!eZ,. Let ay be a positive integer greater than n 
such that ord,(a—a)>N (N_ will be chosen later). Then 
Ao(ay — 1)--- (29 —n + I)/n! = (0)E ZC Z,. It now suffices to show 
that for suitable N the difference between a(ay — 1)---(@9 — 2 + 1)/n! and 
ala — 1)---(a—n+ 1)/n! has | |, < 1. But this follows because the 
polynomial X(X¥ — 1)---(X — n + 1) is continuous. Thus, 

B,, A(X) € Z,[[X]] if a € Z,. 

As an important example of the case a€Z,, suppose that a = 1/m, 
meZ, ptm. Let x € D(I~). Then it follows by the same argument as used to 
prove log,(1 + x)(1 + y) = log,(1 + x) + log,(1 + y) that we have 

[Bijm,p(x)]™ = 1 + x. 
Thus, Bym.p(x) is an mth root of | + x in Q. (If p|m, this still holds, but now 
we can only substitute values of x in D(|m|,p~?-»-).) So, whenever a is an 
ordinary rational number we can adopt the shorthand: B,,,(X) = (1 + X)?*. 

But be careful! What about the following “paradox”? Consider 4/3 = 
(1 + 7/9)2; in Z, we have ord, 7/9 = 1, and so for x = 7/9 andn > 1: 

1/20/2 — 1)---(/2 —n+ I) An 
n! 


Xx", 


<77-"/|n!|7 < 1. 
7 


Hence 
1 > |(1 + 3)? — Uy = [$ — Ne = [ly = 1. 
What’s wrong?? 

Well, we were sloppy when we wrote 4/3 = (1 + 7/9)!/*. In both R and 
Q, the number [6/9 has two square roots + 4/3. In R, the series for (1 + 7/9)'? 
converges to 4/3, i.e., the positive value is favored. But in Q,, the square 
root congruent to | mod 7, i.e., —4/3 = 1 — 7/3, is favored. Thus, the exact 
same series of rational numbers 

S 1/20/2 — 1)---Ci/2 —n+ 1) (3) 
n=0 ! 9 


n 
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converges to a rational number both 7-adically and in the Archimedean 
absolute value; but the rational numbers it converges to are different! This is 
a counterexample to the following false “theorem.” 


Non-theorem 1. Let 5°, a, be a sum of rational numbers which converges to a 
rational number in | |, and also converges to a rational number in| |. Then 
the rational value of the infinite sum is the same in both metrics. 


For more “‘ paradoxes,” see Exercises 8-10. 


EXERCISES 


1. Find the exact disc of convergence (specifying whether open or closed) of the 
following series. In (v) and (vi), log, means the old-fashioned log to base p, 
and in (vii) ¢ is a primitive pth root of 1. [ ] means the greatest integer 
function. 


(i) Dnt xX" (iii) 2 p"X™ (vy) X phox" = (vii) TCE — 1IYX%Jn! 
Gi) S pmemA" Civ) E prXP™ (vi) E phn 

2. Prove that, if > a, and > b, converge to a and 5b, respectively (where a,, b,, a, 
be Q), then > cn, where cr = Dio aibn-1, converges to ab. 


3. Prove that 1 + XZ,[[X]] is a group with respect to multiplication. Let D be 
an open or a closed disc in 2 of some radius about 0. Prove that {fe 1 + 
XQ[LX]] | fconverges on D} is closed under multiplication, but is not a group. 
Prove that for fixed A, the set of f(X) = 1 + 3/21 a,X' such that ord, a, — Ai 
is greater than 0 for alli = 1,2, ... and approaches 00 as i —> 00, is a multipli- 
cative group. Next, let fel + XZ,[[X]],/ = 1, 2,3, .... Let f(X) = TTR, 
f,(X). Check that f(X) ¢€1 + XZ,[LX]]. Suppose that all of the /; converge 
in the closed unit disc D(1). Does f(X) converge in D(1) (proof or counter- 
example)? If all of the nonconstant coefficients of all of the f; are divisible by 
p, does that change your answer (proof or counterexample)? 


4. Let {a,} < Q be a sequence with |a,|, bounded. Prove that 
~ n! 
Px x(x + 1)(x + 2)---(e +n) 


converges for all x € Q not in Z,. What can you say if x €Z,? 


5. Let i be a square root of —1 in Q, (actually, i lies in Q, itself unless p = 3 
mod 4). Prove that: exp,(ix) = cos, x + isin, x for x€ D(p~¥@?-»-), 

6. Show that 2?~' = 1 (mod p’) if and only if p divides )?7{ (—1)//j (of course, 
meaning that p divides the numerator of this fraction). 

7. Show that the 2-adic ordinal of the rational number 

2 + 22/2 + 23/3 + 24/4 + 25/5 +--+ + 2"/n 

approaches infinity as n increases. Get a good estimate for this 2-adic ordinal 
in terms of n. Can you think of an entirely elementary proof (i.e., without 


using p-adic analysis) of this fact, which is actually completely elementary 
in its statement? 
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8. 


10. 


11. 


12. 
13. 


14. 
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Find the fallacy in the following too-good-to-be-true proof of the irrationality 
of zr. Suppose 7 = a/b. Let p # 2 bea prime not dividing a. Then 


0 = sin(pb7) = sin(pa) = s (— 1)"(pa)?"*1/(2n + 1)! = pa (mod p?), 
n=0 


which is absurd. 


Find the fallacy in the following proof of the transcendence of e. Suppose e 
were algebraic. Then e — 1 would also be algebraic. Choose a prime p # 2 
which does not divide either the numerator or denominator of any coefficient 
of the monic irreducible polynomials satisfied by e and by e — 1 over Q. 
You can show as an exercise that this implies that jel, = Je — 1|, = 1. We 
have: 1 = je — 1|,? = |(e — 1)?|p = |e? — 1 — >Pz2@)(—e)'|,. Since the 
binomial coefficients in the summation are all divisible by p, and since 
|-el|, = 1, it follows that 1 = |e? — 1|, = |>v=ip"/n!|,, which is impossible 
since each summand has | |, < 1. 


(a) Show that the binomial series for (1 — p/(p + 1))~" (where x is a positive 
rational integer) and for (1 + (p? + 2mp)/m?)'? (where m is a rational 
integer with m > (V2 + 1)p, ptm) converge to the same rational number as 
real and as p-adic infinite sums. 

(6) Let p => 7, n = (p — 1)/2. Show that (1 + p/n?)'/? gives a counter-example 
to Non-theorem 1. 


Suppose that a € Q is such that 1 + « is the square of a nonzero rational number a/b 
(written in lowest terms, with a and b positive). Let S be the set of all primes p for 
which the binomial series for (1 + «)'/? converges in | |,. Thus, pe S implies that 
(1 + «)'/? converges to either a/b or —a/b in | |,. We also include the “infinite 
prime” in S if the binomial series converges in | |,, =| |, ie. if «e€(—1, 1). Prove 
that: 


(a) For pan odd prime, p € S ifand only if p|a + bor p|a — b,inwhichcase(1 + a)!” 
converges to —a/b when p|a + b and to a/b when p|a — b. 

(b) 2€S if and only if both a and b are odd, in which case (1 + «)'/? converges to 
a/b when a = b (mod 4) and to —a/b when a = —b (mod 4). 

(c) 0 €S if and only if 0 < a/b < ge in which case (1 + «)1/? converges to a/b. 
(d) There is no « for which S is the empty set, and S consists of one element if « = 8, 
18,3, 2 and for no other «. 

(e) There is no « other than 8, 4€, 3, 3 for which (1 + «)!/? converges to the same value 
in| |, for all pe S. (This is one example of a very general theory of E. Bombieri.) 


Prove that for any nonnegative integer k, the p-adic number >7=9 np" is in Q. 


Prove that in Q3: 


© 322 


btd 320 = ; 3n 
Pay n42n 2 2, n4” 


n= 


Show that the disc of convergence of a power series f(X) = > a,X" is 
contained in the disc of convergence of its derivative power series f’(X) = 
> na,X"~}, Give an example where the regions of convergence are not the 
same. 


Exercises 


15. (a) Find an example of an infinite sum of nonzero rational numbers which con- 
verges in | |, for every p and which converges in the reals (i.e., in | | = | |). 
(b) Can such a sum ever converge to a rational number in any | |,or| |o? 


16. Suppose that, instead of dealing with power series, we decided to mimic the 
familiar definition of differentiable functions and say that a function f: Q > Q 
is ‘differentiable’ at ae Q if (f(x) — f(a))/(x — a) approaches a limit in Q 
as |x — al, 0. First of all, prove that, if f(X) = Dr=0an,X" is a power 
series, then it is differentiable at every point in its disc of convergence, and it 
can be differentiated term-by-term, i.e., its derivative at a point a in the disc of 
convergence is equal to >=, na,a"~1. In other words, the derivative function 
is the formal derivative power series. 


17. Using the definition of “differentiable” in the previous problem, give an 
example of a function f: Q—»Q which is everywhere differentiable, has 
derivative identically zero, but is not locally constant (see discussion of 
locally constant functions at the beginning of §II.3). This example can be 
made to vanish along with all of its derivatives at x = 0, but not be constant 
in any neighborhood of 0. Thus, it is in the spirit of the wonderful function 
e~'*? from real calculus, which does not equal its (identically zero) Taylor 
series at the origin. 


18. The Mean Value Theorem of ordinary calculus, applied to f:R—>R, 
f(x) = x? — x, on the interval {xe R | |x] < 1}, says that, since f(1) = 
S(—1) = 0 (here we are assuming that p > 2), we must have 


f(2) =0 forsome c«eR, |e] < 1. 


(In fact, « = +(1/p)'!~- works.) Does this hold with R replaced by Q and 
| | replaced by | |,? 


19. Let f: Q, > Q, be defined by x = > a,p"+> > g(an)p", where > a,p" is the 
p-adic expansion of x and g: {0, 1, ..., p — 1} Q, is any function. Prove 
that f is continuous. If g(a) = a? and p ¥ 2, prove that f is nowhere differentiable. 


20. Prove that for any N and for any / = 1,2,..., N, 
(+ Xe" — Le piZ[X] + XP" OZ, X). 


Suppose that a/b is a rational number with |a/b|, < 1, and you want to find 
the first M coefficients (M is a large number) of the power series (1 + X)?? 
to a certain p-adic accuracy. Discuss how to write a simple algorithm (e.g., 
a computer program) to do this. (Only do arithmetic in Z/p"Z, not in Q, 
since the former is generally much easier to do by computer.) 


21. If R is any ring, define the ring R[[X1, ..., X,]] (abbreviated R[[X]]) of 
formal power series in n variables as the set of all sequences {r;,....,:,} indexed 
by n-tuplesi,, ..., i, of nonnegative integers (such a sequence is thought of as 
> ViyesinXi't >> X's and sometimes abbreviated > r,X'), with addition and 
multiplication defined in the usual way. Thus, {ri,,.in} + {S.-i} = 
Miasevinds Where fying = Tieentg + Sizesigs and {Fie igh (Std = 
igvestgls Where fist, = Diy ig Skyy, With the summation taken over 
all pairs of a-tuples /,,...,/n and ky,...,k, for which fy + ky = hh, 
Jo + ke = a, ...5jn + Kn = ine 
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By the minimal total degree deg f of a nonzero power series f we mean the 
least d such that some r,.....;, with 4 + i2 +--- + i, = dis nonzero. We 
can define a topology, the “‘X-adic topology,” on R[[X]] by fixing some 
positive real number p < | and defining the ‘* X-adic norm” by 


fx & prees (|O|x is defined to be 0). 


(1) Show that | [x makes R[[ XJ] into a non-Archimedean metric space (see 
the first definition in §1.1; by ‘*non-Archimedean,”’ we mean, of course, that 
the third condition can be replaced by: d(x, y) < max(d(x, z), d(z, y))). Say 
in words what it means for |f|x to be <1. 

(2) Show that R[[X]] is complete with respect to | |x. 

(3) Show that an infinite product of series f,; € R[LX]] converges to a nonzero 
number if and only if none of the f; is zero and | f; — 1|, +0 (where 1 is the 
constant power series {r;,__;,} for which ro, = 1 and all other r;,;, = 0). 
We will use this in §2 to see that the horrible power series defined at the end of 
that section makes sense. 

(4) If fe R[[X]], define (QZ to be the same as f but with all coefficients 
Ny,¥4, With i; +---+ i, > d replaced by 0. Thus, fj is a polynomial in x 
variables. Let 91, ..., 2, € R[[X]]. Note that fa(gi(X), go(X), ..., &n(X)) is 
well-defined for every d, since it’s just a finite sum of products of power series. 
Prove that {f(g,(X), ..-, 8n(X))}a=0,1,2,..., is a Cauchy sequence in R[[X]] if 
lgjlxy < 1 forj = 1,...,n. In that case call its limit fo g. 

(5) Now let R be the field R of real numbers, and suppose that f; fa, 21, ..., 
8n are as in (4), with |g;|x < 1. Further suppose that for some « > 0 the 
series f and all of the series g; are absolutely convergent whenever we substi- 
tute X, = x, in the interval [—«, €]—< R. Prove that the series fo g is absolutely 
convergent whenever we substitute ; = x; in the (perhaps smaller) interval 
[—e’, e’] for some «’ > 0. 

(6) Under the conditions in (5), prove that if fo g(x, ..., Xn) has value 0 
for every choice of x1, ..., x, €[—e’, e’], then fo g is the zero power series 
in R([X]]. 

(7) As an example, let n = 3, write X, Y, Z instead of X1, X2, Xz, and let 


EE IIE NM ey 


MG YZ) = S (yxy t Vii — Zi), 
i=1 


Ei(X, Y, Z) = x, 
2X, Y,Z) = Y, 
gA(X,¥,Z)=X+ V+ XY. 


As another example, let ” = 2, 


elX, Y) = > Xi, 
g2(X, Y) = X. 


Explain how your result in (6) can be used to prove the basic facts about the 
elementary p-adic power series. (Construct the f and g; for one or two more 
cases.) 


2 The logarithm, gamma and Artin—Hasse exponential functions 


2. The logarithm, gamma and Artin—Hasse exponential 
functions 


In this section we look at some further examples of p-adic analytic functions 
(more precisely, “locally” analytic functions) which have proven useful in 
studying various questions in number theory. The first is Iwasawa’s extension 
of the logarithm. 

Recall that the Taylor series log, x = 5, (—1)"**(x — 1)"/n converges 
in the open unit disc around 1. The following proposition says that there is a 
unique function extending log, x to all nonzero x and having certain con- 
venient properties. 


Proposition. There exists a unique function log,:Q* —+ Q (where Q” = 
Q — {0}) such that: 


(1) log, x agrees with the earlier definition for |x — 1|, < 1, i.e., 
log, x = > (-1)"**(x — )"/n for |x — 1|, < 1; 
n=1 


(2) log, (xy) = log, x + log, y for all x, ye Q”; 

(3) log, p = 0. 
Proor. Recall from §III.4 that any nonzero x € 2 can be written in the form 
x = p’a(x,)<x,>, where p’ is some fixed root of the equation x’ — p* = 0, 
with r = a/b = ord, x, w(x,) is a root of unity, and |¢x,> — 1|, < 1. There 
is thus only one possible way to define log, x consistently with (1)-(3). 
Namely, (2) and (3) imply that log,(p’) = log,(w(x,)) = 0, and hence we 
must have 


log, x = > (-1 x) ~ 1)". 


We thus know that there is at most one definition of log, x which has 
the desired properties, namely, the definition log, x = log,<x, >. It remains 
to show that the three desired properties are actually satisfied. Properties 
(1) and (3) are obvious from the definition. 

In the course of our definition of log, x, we made a rather arbitrary choice 
of a bth root of p*. But if we had chosen another bth root of p* for our p’, this 
would have altered x, by a bth root of unity and hence would have altered 
c(x,) and <x,> by certain roots of unity. Notice that the new <x> would 
have to differ from the old <x,> by a pth power root of unity, because € = 
<x >/<x,> is in the open unit disc about 1 (see Exercise 7 in §III.4). In any 
case, the definition log, x = log,<x,)> would not be affected by this replace- 
ment of x, by x, because log, ¢ = 0, as remarked in §1. Thus, our definition 
really does not depend on the choice of p’. 

We now prove property (2). Let x = p’a(x1)(x1>, y = PPO1)K1), 
z= xy = p’**a@(z,)<z,>. Now p’** is not necessarily the same fractional 


87 


IV __ p-adic power series 


power of p as p’p*; it may differ by a root of unity. But the definition of log, z 
does not change if we change our choice of p’** to p’p*. In that case, z, = 
z/p’p” = X11, and so ¢z;> = <x,><y1), and 


log, z = log,<z;> = log,<x,> + log,<y1> = log, x + log, y, 


where the middle equality was proved in the last section in our discussion of 
the power series > (— 1)"*!x"/n. This completes the proof of the proposition. 


O 


Now let x9 # 0 be a fixed point of Q. Let r = |x9|,, and suppose that 
x is in the largest disc about x9 which does not contain zero, i.e., D,,(r" ). 
Then |x/x9 — 1|, < 1, and so 


log, x = log,(xo(1 + x/xo — 1)) = log, xo + > (—1)"* "(x — xo)"/nx$. 
n=1 


Thus, in D,,(r~) the function log, x can be represented by a convergent power 
series in x — X9. Whenever a function can be represented by a convergent 
power series in a neighborhood of any point in its region of definition, we 
say that it is locally analytic. Thus, log, x is a locally analytic function on 
Q — {0}. 

Recall from Exericse 16 of §1 that the usual definition of the derivative 
can be applied to p-adic functions, and that power series are always differ- 
entiable in their region of convergence, with the derivative obtained by 
term-by-term differentiation. In particular, applying this to log, x in D,,(r_), 
we obtain 


d oO 
sy log, x = > (— UM — x0)" x5 
x n=1 


= x5! > (1 = x/xoy" 


= X69 'M(x/xo) = 1/x 


for xe D,,(r~). We conclude: 
Proposition. log, x is locally analytic on Q — {0} with derivative 1/x. 


The next function we discuss is the p-adic analogue of the gamma-function. 

The classical gamma-function is a function from R to R which “inter- 
polates” n! (actually, '(s) is defined for complex s, but we aren’t interested in 
that here). More precisely, it is a continuous function of a real variable s 
excluding s = 0, —1, —2, —3,... (where it has “poles”) which satisfies 


Ts+1)=s! for s=0,1,2,3,.... 
Since the positive integers are not dense in R, there are infinitely many 
functions which satisfy this equality; but there is only one which has certain 
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other convenient properties. This gamma-function can be defined for s > 0 
by: 


T(s) = J ee xe 
0 

Thus, the gamma-function is the “ Mellin transform” of e~ * (see §7 of Chapter 
II). It is not hard to check (see Exercises 6-7 below) that this improper integral 
converges for s > 0, and that the function I(s) defined in this way satisfies 
I(s + 1) = sI(s) for all s > 0. In addition, T(1) = f° e~* dx = 1; then 
T(s + 1) = sI(s) = x(s — IT (s — 1) = --- = s!VC1) = !, so this function 
really is an interpolation of the factorial function. 

We would now like to do something similar p-adically, i.e., find a con- 
tinuous function from Z, to Z, whose values at positive integers s + 1 
coincide with s!. 

We shall assume that p > 2 in what follows; minor modifications are 
needed if p = 2. 

Recall from §2 of Chapter II under what conditions a function f(s) on 
the positive integers can be interpolated to all of Z,. Such a continuous 
interpolation exists if and only if for every « > 0 there exists N such that 


s =s'(mod p%) implies | f(s) —f(s’)|, < & (*) 
In that case the interpolating function is unique and is defined by 


f(s)= lim f(k). 
k>s,keN 

Unfortunately, the basic condition (*) does not hold for f(s) = (s — 1)!, 
since, for example, | f(1) — f(1 + p”)|, = 1 for any N > 0, since p divides 
s! whenever s > p. The problem is that, whenever s is a large integer in the 
old-fashioned archimedean sense, s! is divisible by a large power of p, ie., 
f(s) > 0 p-adically as s > o. 

We could modify the factorial function in a way analogous to how we 
modified the Riemann zeta-function in Chapter II (“removing the Euler 
factor”) by discarding indices divisible by p. That is, we could try to inter- 
polate the f(s) defined by: 

: s! 
for d= Th i= ppppipmm 
However, once again we have problems (see Exercise 8 below). But if we 
modify f(s) one final time by a mere change in sign for odd s, we can then 
interpolate. 


Proposition. Lez 
Tk) =(-1* TT i k=1,2,3,.... 


j<k, prj 


(When k = 1, the empty product is defined to be |, i.e., T,(1) = —1.) 
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Then 1, extends uniquely to a continuous function T,: Z, > Z, defined by 


T,(s)= lim (-1* [] jf. 


k>s,keN i<ki,pyj 
Proor. It suffices to prove (*); in fact we shall prove that 
ki’ =k+k,p™ implies I,(k) =T,,(k’) (mod p”). 
Notice that ,(k)€ Z, (that is why I, will be a map from Z, to Z; as soon 
as we show that the continuous interpolation exists). Hence the right side 
of the above implication is equivalent to the congruence 
1=TkyT(k) =(-1** [] 7 (mod p%). 
k<j<k’,pyj 
If we prove this for k, = 1, i.e., fork’ = k + p’, then by multiplying together 
the congruences with k replaced by k + ip’ (i=0,...,k, — 1) 
l= (- frst Gee = eres) Il j (mod p’), 
k+ipN <j<kt+(it 1)p%, pt j 
we immediately obtain the desired congruence. Since p is odd, we have 
(— 1)?" = —1, and so we have reduced the proof to showing that 
j = —1(mod p). 
kSj<kt+pN, py j 
Since the product runs through every congruence class in (Z/p%Z)* exactly 
once, we have 
j= [1 jf (modp%). 
k<j<k+pN,pyj O<j<P,pyi 
Thus, it remains to prove that the product on the right is = —1 (mod p’). We 
now pair off elements j and j’ which satisfy jj’ = 1 (mod p’). For each j 


there is precisely one such j’. Since p > 2, there are only two values of j for 
which j’ = j, i.e., for which j* = 1 (mod p¥) (see Exercise 9 below). Thus, 
TT = =(sa(-1) = -1 (mod p®, 
O<j<pN, prj 


as desired. O 


The key step in the proof, the congruence for [| ,<,»,,, jj, iS a generaliza- 
tion of Wilson’s theorem, which is the case N = 1: (p — 1)! = —1 (mod p). 


Basic properties of I,,. 


T,(s + 1) = if seZx; 


ee 1 
T,(s) —1 if sepZ,. () 


Proor. Since both sides are continuous functions from Z, to Z,>, it suffices 
to check equality on the dense subset N, i.e., when s = ke N. But then it 
follows immediately from the definition of I’,(k). O 
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(2) IfseZ,, write s = so + ps;, where sy {1, 2,..., p} is the first digit in s 
unless s € pZ,, in which case sy = p rather than 0. Then 


Ts), — s) = (-)*. 


Proor. Again by continuity it suffices to check this when s = k. For s = 1 the 
equality holds because I',(1) = —1 by definition, and (0) = —I,(1) = 1 
by property (1). Now use induction, assuming the equality for s = k and then 
proving it for k + 1. Using property (1), we have 


r(s+ DF,4 —G+))_ [-s(-(-9) = -1 if seZj; 
T,()I',1 — ) a et en eee if sepZ,, 


and this shows that the equality in (2) for s + 1 follows from the equality for s. 


(3) For se Z,, define sg and s, as in property (2). Let m be any positive 
integer not divisible by p. Then 


Tio Ts + h)/m) 
T,(s) [1 fz 1 ,(h/m) 


=m? ~ $007 (P~ 1))s1, 


Remarks. 1. The expression on the right makes sense, because the number 
being raised to the p-adic power, namely m~~"), is congruent to 1 mod p. 
(See §2 of Chapter II.) Of course, sg is a positive integer, so m'~*° makes 
sense. 

2. The classical gamma-function can be shown to satisfy the “Gauss— 
Legendre multiplication formula” 


Tgsd Gs + Wim) _ a, 
Ms) TTR P(h/m) ' 


PROOF OF (3). Let f(s) be the left side and let g(s) be the right side of the 
equation. Both f and g are continuous, so it suffices to check equality for 
s =keN. Fors = k = 1 both sides are clearly 1. We proceed by induction 
on k. We have 


f(st+1)_T,(s)F,((s/m) +1) fi/m if seZ>; 

f(s) «(s+ DF, (s/m) 1 if’ sepZ,. 
On the other hand, ifs € Z?, we have g(s + 1)/g(s) = 1/m, since then (s + 1)o 
= So + land (s + 1), = s,, while if s € pZ, we have g(s + 1)/g(s) = 1, since 
then (s + 1)y = So — (p — 1) and (s + 1), = s, + 1. Hence f(s + 1)/f(s) = 
g(s + 1)/g(s), and the induction step follows. {ral 


This concludes our discussion of the p-adic gamma-function. 
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We now introduce an ‘“‘elementary function” which is “better” than 
exp,—has a larger disc of convergence—and which can often be used to play 
a similar role to exp in situations when better convergence than D(p~1/--) 
is needed. To do this, we first give an infinite product formula for the ordinary 
exponential function, in terms of the ‘“‘MObius function” p(n), which is often 
used in number theory. For ne {1, 2, 3, ...} we define 


(i= te if n is divisible by a perfect square greater than 1; 
ae (—1)*, ifm is a product of & distinct prime factors. 


Thus, 1 = 2(1) = 2(6) = #(221) = (1155), 0 = 49) = 4(98), —1 = w(2) = 
(97) = 2(30) = u(105). A basic fact about p» is that the sum of the values 
of « over the divisors of a positive integer 1 equals | ifn = 1 and 0 otherwise. 
This is true because, if 1 = p,%:---p,’s is the decomposition into prime 
factors, and if s > 1, then we have: 


>, u(d) = > w(pret- ++ ps) = > (—1)24 = (1 — 1)° = 0. 
In 


all possible 
€;=Oorl,i=1,---,s 


We now claim that the following ‘“‘formal identity” holds in Q[[X]]: 


exp(X) = [T= x0" ST] Be wen, of X°9. 
(Note that this infinite product of infinite series makes sense, since the nth 
series starts with 1 + u(n)/nX", i.e., has no powers of X less than the nth; 
thus, only finitely many series have to be multiplied together to determine 
the coefficient of any given power of X.) To prove this, take the log of the 
right hand side. You get: 


log] 1 = XP) - ain or S #0) 1501 = XxX") as > u(n) ) x 


2 Duo] G = old my, 


[7 


ll 
mMs 
a 


gathering together coefficients of the same power of X. By the basic property 
of » proved above, this equals ¥. Taking exp of both sides, we obtain the 
desired formal identity. 

(Several times we have used the principle, mentioned in the discussion of 
log, and developed in Exercise 21 of the last section, that manipulation of 
formal power series as though the variables were real numbers is justified 
as long as the series involved all converge in some interval about 0.) 

If we look at []7-, (1 — X*)~4™!" p-adically, we can focus in on where 
the “trouble”? comes in. By “trouble” I mean why it only converges on 
D(p~*?--) and not on D(1~). Namely, if p|n and n is square-free, then 
(1 — X¥")7#™/ only converges when an x is substituted for which 


|x"|, = |x|," D(r-), where r = | )" 


Be po ue-dnl.. 
> Pp 
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For example, if n = p, then this converges precisely when 


pea Ly. calla is 
|x| > < (|p p =P * 


But as long as p{n we’re O.K.: that is, since —p(n)/neZ,, we have 
(1 — X*)-4#" eZ, [[X]]. (Remember in all this that (1 — X*)* is just 
shorthand for B,,(—X" = So a(a — 1)---(@-—i + Ili} (—X")) 

So let’s define a new function £,, which we call the “‘Artin—Hasse ex- 
ponential,”’ by just forgetting about the “bad” terms in the infinite product 
(this is very similar to our “‘removing the Euler factor” in order to define 
the p-adic zeta-function in Chapter II): 


E(X) = [ ]G — x) e OLX]. 
n=1 
pin 
Since each infinite series B_jcnyn,p(— X”) is in | + X"Z,[[X]], their infinite 
product makes sense (only finitely many have to be multiplied to get the 
coefficient of any given power of X), and it lies in 1 + XZ,[[X]]. 
We can easily find a simpler expression for E,(X), using the property of 
the uw function: 
_ fil ifisa power of p; 
ana Ha) = ‘5 otherwise. 
This property follows immediately from the earlier property of », applied to 
n| p's” in place of n. Considering £,(X) over R (or C) and taking the loga- 


rithm as before gives: 
log E,(X) = — > ee => |= >: H)| 
= j=1 nlj.ptn 


Hence, 


(eee, Cee oe 
EX) = exp(X + a + me + Pg + ): 
as an equality of formal power series in Q[[LX]]. 

The important thing about £,(X), in distinction from exp, (X), is that 
E,(X) € Z,[[X]]. Thus, E,(X) converges in D(1-). It can be seen (Exercise 11 
of §I1V.4) that this is its exact disc of convergence, i.e., it does not converge 
on D(i). 

We conclude this section with a useful general lemma, due to Dwork. 


Lemma 3. Let F(X) = > a,X'e1 + XQ,[[X]]. Then F(X)e1 + XZ, (XII 
if and only if F(X”)(F(X))” € 1 + pXZ,[[X]).- 


Proor. If F(X) ¢1 + XZ,[[X]], then, since (a + 6)? = a? + 6? (mod p) and 
a’ = a(mod p) for ae Z,, it follows that 


(F(X)? = F(X*) + pG(X) for some G(X) € XZ,[[X]]. 
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Hence 


F(X?) _ , _ pG(Xx) 
Dy = | ecepe ©! + PX SolL*T 


because (F(X))? € 1 + XZ,[[X]] and hence can be inverted (see Exercise 3 
of §1). 
In the other direction, write 


F(X?) = (F(X)/PG(X), = G(X) e 1 + pXZ,[[X]], 
GX) => bX, F(X) = > aX". 


We prove by induction that a, ¢Z,. By assumption, a) = |. Suppose a; € Z, 
for i < n. Then, equating coefficients of X" on both sides gives 


Gy, if pdividesn| _ : ai. {x i\? x i 
0 atherwise = coefficient of X" in 2, aX 1+ » b,X')- 


If we expand the polynomial on the right, subtract a,;p in the case p|n (and 
recall that ay), = a%,, mod p), and notice that the resulting expression 
consists of pa, added to a bunch of terms in pZ,, we can conclude that 
PGy € PL, 1.€., Ay, EZ. O 


Dwork’s lemma can be used to give an easy direct proof (without using 
the infinite product expansion) that the formal power series £,(X) = 
eX +X? Ip) + (Xp) + hag coefficients in Z, (see Exercise 17 below). 

Dwork’s lemma, which seems a little bizarre at first glance, is actually an 
example of a deep phenomenon in p-adic analysis. It says that if we know 
something about F(X")/(F(X))?, then we know something about F. This 
quotient expression F(X”)/(F(X))? measures how much difference there is 
between raising X to the pth power and then applying F, versus applying F 
and then raising to the pth power, i.e., it measures how far off F is from 
commuting with the pth power map. The pth power map plays a crucial 
role, as we’ve seen in other p-adic contexts (recall the section on finite fields). 
So Dwork’s lemma says that if F ““commutes to within mod p”’ with the pth 
power map, i.e., F(X”)/(F(X))? = 1 + p-> (p-adic integers) X', then F has 
p-adic integer coefficients. 

We apply this lemma to a function that will come up in Dwork’s proof of 
the rationality of the zeta-function. First, note that Lemma 3 can be general- 
ized as follows: Let F(X, Y) = > an,,X" Y™ be a power series in two variables 
X and Y with constant term 1, ie., 


F(X, Yye1 + XO,[LX%, Y]] + YQ,[LX, Y]]. 
Then all the a,,,,’s are in Z, if and only if 
F(X?, Y)(F(X, Y))?e1 + pXZ,[[X, Y]] + pYZ,ILX, Y]). 


The proof is completely analogous to the proof of Lemma 3. 
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We now define a series F(X, Y) in Q[[X, Y]] as follows: 
F(X, Y) = Bx oY) Bex? — xyp,o( Y?) Boxe? — x92, 0( VY"): +» Bexe™_ x9" yp, 0 VY"): * 
(1 + YL + YryrP- eq, + YP?)(xP? — xPy/p2 oie 
x (1+ yeryort=x9" aon 
= > AG) wie Dy 


ll 


~ 
i=) 


— 


i) ( ao xXx” part > Ce (= = > Cae 1) 


n=1 \i=0 p” p” 
pr ypn- tp” 
(— & -i+1) = ). 
P i! 


Since we only have to take finitely many terms in the product to get the co- 
efficient of any X"Y™, this is a well-defined infinite series F(X, Y) = 
dannX*Y™in 1 + XQ,[[X, Y]] + YQ,[[X, Y]]. We use the generalization 
of Lemma 3 to prove that a,,,, € Zp. Namely ,we have 
F(X®, Y*") (1 + Y°)77(1 + ¥ 2287? - XP] 4 VP*yaP?- KPA Ip? 
OX, YY) XL EYAL EVP 
a Uhest F2F 
~ (+ YX 
We must show that (1 + Y?)*/(1 + Y)?* is in 1 + pXZ,[[X, Y]] + 
PYZ,[[X, Y]]. Applying Lemma 3 in the other direction shows that, since 
1+ Yel + YZ,[[Y]], it follows that 


(1 + YY) + Y)? = 14 pYG(Y), G(Y)€Z,[[Y]]. 
Thus, 
ype = + pyacny = 5 2 


which is clearly in 1 + pXZ,[[X, Y]] + pYZ,[[X. Y]]. We conclude that 
F(X, Y)eZ,{[X, Y]1.- 


Da Et) racny, 


EXERCISES 

. Find log; 42 mod 7* and log, 15 mod 2!?. 

. Prove that the image of Z, under log, is pZ, for p > 2 and is 4Z, for p = 2. 

. For p > 2 and ae Z;, prove that p? divides log, a if and only if a?~! = 1 mod p’. 


. Find the derivative of the locally analytic function x log, x — x. 


aA & Ww NH = 


. Suppose that a function f: Q* — Q satisfies properties (1) and (2) of the proposition 
at the beginning of this section. Prove that f(x) must be of the form f(x) = log, x 
+ cord, x for some constant c EQ. 
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6. 
7. 


10. 


11. 


12. 


13. 


96 


Verify that the improper integral {> x*~ 'e~* dx converges if and only if s > 0. 


Using integration by parts, prove that '(s + 1) = sI(s), where I's) is defined by the 
integral in Exercise 6. 


. Show that the function f(s) = [J ;<s, py; J for se N does not extend to a continuous 


function on Z,. 


. For p > 2, show that if j? — 1 = 0 mod p®, then j = +1 mod p™. What happens if 


p=2? 


Show that I’,(1/2)? = —(>}), where (=) = 1 if x? = —1 has a solution in F, and 
it equals — 1 otherwise. 


Compute I’,(1/4) and I’,(1/3) to four digits (if you don’t have a computer or pro- 
grammable calculator handy, then compute them to two digits). 


Let ./ —1 eZ, denote the root with first digit 3, and let ./—3 € Z, be the root with 
first digit 2. Use Exercise 9 of §1.5 and Exercise 11 above to verify the following 
equalities to 4 digits: 


T,(1/4)? = —2 + /-1;1,(1/3)? = (1 — 3./—3)/2. 


Note: These equalities are known to be true, but no down-to-earth proof 
(without p-adic cohomology) is known for them. They are special cases of a more 
general situation. To explain this, let us take, for example, the second equality. Then 
for p = 7 we let € = e?*”7 € C be a primitive pth root of unity, and let 


o =(-1+4./—3)2 = ei3 


be a nontrivial (p — 1)th root of unity. Next take a generator of the multiplicative 
group F,° (see Exercise 2 of §III.1); in our case p = 7 let us take 3. Then 


6 
8s = wit 

is known as a Gauss sum. It is not hard to verify that the right side of the second 
equality above is equal to g3/7. More generally, one can prove that, whenever a/d 
is a rational number whose denominator divides p — 1, the p-adic number I’ ,(a/d)* 
is an element of the field Q(w), where w is a primitive dth root of unity. (Exercise 10 
gives another special case of this, in which a/d = 1/2,@ = —1.) Namely, it turns out 
that I’,(a/d)‘ can be expressed in terms of suitable Gauss sums. (For a treatment of 
this, see Lang, Cyclotomic Fields, Vol. 2, or else Koblitz, p-adic Analysis: a Short 
Course on Recent Work.) 

Note, by the way, that this shows a major difference between I’, and the classical 
T-function, since, for example, (1/3) is known to be transcendental. 


Let s = r/(p — 1) bea rational number in the interval (0, 1), and let m be a positive 
integer not divisible by p. Prove that 

[Tro U,((s + h)/m) 

r,(s) TT h= t D¢h/ ‘m) 
is equal to the Teichmiiller representative of m?~94~?) (i.e., to the (p — 1)th 


root of unity in Z, which is congruent mod p to m'~?*" = m’). (Recall that if , 
is replaced by the classical I'-function, then this expression equals m}~‘). 


3 Newton polygons for polynomials 


14. Prove that exp, X, (sin, X)/X, and cos, X have no zeros in their regions of 
convergence, and that £,(X) has no zeros in D(1~). 


15. Find the coefficients up through the X* term in E,(X) for p = 2, 3. 


16. Find the coefficients in E,(X) through the X®~+ term. Find the coefficient of 
X?, What fact from elementary number theory is reflected in the fact that the 
coefficient of X? lies in Z,? 


17. Use Dwork’s lemma to give another proof that the coefficients of E,(X) 
are in Zp. 


18. Use Dwork’s lemma to prove: Let f(X) = exp(>0 b,X"), b,€ Q». Then 
f(X)e1 +4 XZ,[[X]] if and only if 5:-. — phe pZ, for i= 0,1,2,... 
(where b-; a 0). 


3. Newton polygons for polynomials 


Let f(Y) = 1 + 37, 4a,X'e1 + XQLX] be a polynomial of degree # with 
coefficients in Q and constant term |. Consider the following sequence of 
points in the real coordinate plane: 


(0, 0), (1, ord, a@,), (2, ord, a2), ..., (é, ord, a), ..., (”, ord, ay). 


(If a; = 0, we omit that point, or we think of it as lying “infinitely” far 
above the horizontal axis.) The Newton polygon of f(X) is defined to be the 
“convex hull’ of this set of points, i.e., the highest convex polygonal line 
joining (0, 0) with (n, ord, a,) which passes on or below all of the points 
(i, ord, a,). Physically, this convex hull is constructed by taking a vertical 
line through (0, 0) and rotating it about (0, 0) counterclockwise until it hits 
any of the points (i, ord, a;), taking the segment joining (0, 0) to the last such 
point (i,, ord, a;,) that it hits as the first segment of the Newton polygon, 
then rotating the line further about (i, ord, a,,) until it hits a further point 
(i, ord, a,) (i > i,), taking the segment joining (i,, ord, a;,) to the last such 
point (i, ord, a) as the second segment, then rotating the line about 
(iz, ord, a;,) and so on, until you reach (n, ord, a,). 

As an example, Figure 1 shows the Newton polygon for f(X) = 1 + 
X27 4 4X5 + 3X* in Q;[X]. 

By the vertices of the Newton polygon we mean the points (i;, ord, a,,) 
where the slopes change. If a segment joins a point (i, m) to (i’, m’), its slope is 
(m' — m)/(i' — i); by the “length of the slope’? we mean i’ — i, i.e., the 
length of the projection of the corresponding segment onto the horizontal 
axis. 


Lemma 4. In the above notation, let f(X) = (1 — X/o,)---(1 — X/a,) be the 
factorization of f(X) in terms of its roots «,€ Q. Let A, = ord, 1/a,. Then, 
if X is a slope of the Newton polygon having length I, it follows that precisely | 
of the X, are equal to X. 
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Figure IV.1 


In other words, the slopes of the Newton polygon of f(X) “‘are” (counting 
multiplicity) the p-adic ordinals of the reciprocal roots of f(X). 


Proor. We may suppose the a, to be arranged so that A; < Ap < --- < Ay. Say 
Ay = Ag =--- =A, < Aga. We first claim that the first segment of the 
Newton polygon is the segment joining (0, 0) to (r, rA,). Recall that each a, is 
expressed in terms of I/a,, l/a2,..., 1/a, as (—1)‘ times the ith symmetric 
polynomial, i.e., the sum of all possible products of i of the 1/«’s. Since the 
p-adic ordinal of such a product is at least iA,, the same is true for a;. Thus, the 
point (i, ord, a,) is on or above the point (i, iA), i.e., on or above the line 
joining (0, 0) to (7, ra,). 

Now consider a,. Of the various products of r of the I/a’s, exactly one has 
p-adic ordinal rA,, namely, the product 1/(a,0,---a,). All of the other 
products have p-adic ordinal > rd,, since we must include at least one of the 
Artis Arta, ++ -> Aq. Thus, a, is a sum of something with ordinal rA, and 
something with ordinal >rA,, so, by the “isosceles triangle principle,” 
ord, a, = rj. 

Now suppose i > r. In the same way as before, we see that all of the 
products of i of the 1/«’s have p-adic ordinal >iA,. Hence, ord, a, > id,. If 
we now think of how the Newton polygon is constructed, we see that we 
have shown that its first segment is the line joining (0, 0) with (r, rA,). 

The proof that, if we have A, < Asa. = Asgg = +°* = Asay < Asargi, then 
the line joining (s, A, + Ag +--- + A,)to(s + 7, Ay + Ag +--+ + Ay HAG 41) 
is a segment of the Newton polygon, is completely analogous and will be 
left to the reader. O 


4. Newton polygons for power series 
Now let f(X) = 1+ 52, a;X'e1 + XO[[X]] be a power series. Define 


SAX) = 14+ D2, a,X'e1 + XOQ[X] to be the nth partial sum of f(X). 
In this section we suppose that f(X) is not a polynomial, i-e., infinitely many 
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Figure IV.2 


a; are nonzero. The Newton polygon of f (X) is defined to be the “limit” of the 
Newton polygons of the f,(X). More precisely, we follow the same recipe as in 
the construction of the Newton polygon of a polynomial: plot all of the points 
(0, 0), (1, ord, a,), ..., (i, ord, a), ...; rotate the vertical line through (0, 0) 
until it hits a point (i, ord, a;), then rotate it about the farthest such point it 
hits, and so on. But we must be careful to notice that three things can happen: 

(1) We get infinitely many segments of finite length. For example, take 
J(X) = 1 + S22, p’X', whose Newton polygon is a polygonal line inscribed 
in the right half of the parabola y = x? (see Figure 2). 

(2) At some point the line we’re rotating simultaneously hits points 
(i, ord, a;) which are arbitrarily far out. In that case, the Newton polygon 
has a finite number of segments, the last one being infinitely long. For example, 
the Newton polygon of f(X) = 1 + >, X‘ is simply one infinitely long 
horizontal segment. 

(3) At some point the line we’re rotating has not yet hit any of the (i, ord, a,) 
which are farther out, but, if we rotated it any farther at all, it would rotate 
past such points, i.e., it would pass above some of the (i, ord, a,). A simple 
example is f(X) = 1 + > 2, pX‘. In that case, when the line through (0, 0) 
has rotated to the horizontal position, it can rotate no farther without passing 
above some of the points (i, 1). When this happens, we let the last segment of 
the Newton polygon have slope equal to the least upper bound of all possible 
slopes for which it passes below all of the (i, ord, a,). In our example, the 
slope is 0, and the Newton polygon consists of one infinite horizontal segment 
(see Figure 3). 

A degenerate case of possibility (3) occurs when the vertical line through 
(0, 0) cannot be rotated at all without crossing above some points (i, ord, a;). 
For example, this is what happens with f(X) = > 2, X‘/p”. In that case, f(X) 
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Figure IV.3 


is easily seva to have zero radius of convergence, i.e., f(x) diverges for any 
nonzero x. In what follows we shall exclude that case from consideration and 
shall suppose that f(X) has a nontrivial disc of convergence. 

In the case of polynomials, the Newton polygon is useful because it 
allows us to see at a glance at what radii the reciprocal roots are located. We 
shall prove that the Newton polygon of a power series f(X) similarly tells 
us where the zeros of /(X) lie. But first, let’s make an ad hoc study of a 
particularly illustrative example. 

Let 

X2 x 


Xx 1 
IM Vt zt ateo tet =a zZlosl — x). 


The Newton polygon of f(X) (see Figure 4, in which p = 3) is the polygonal line 
joining the points (0,0), (p — 1, —1), (p? — 1, —2),...,(p’ — 1, —/),...3 
it is of type (1) in the list at the beginning of this section. If the power series 
analogue of Lemma 4 of §3 is to hold, we would expect from looking at this 
Newton polygon that f(X) has precisely p’*! — p’ roots of p-adic ordinal 
1/(p’** — p’). 

But what are the roots of —1/X log,(1 — X)? First, if x = 1 — ¢, where 
is a primitive p’*!th root of 1, we know by Exercise 7 of §III.4 that ord, x = 
1/(p’+? — p’); and we know by the discussion of log, in §IV.1 that log, 
(1 — x) = log, = 0. Since there are p’*? — p’ primitive p’**th roots of 1, 
this gives us all of the predicted roots. Are there any other zeros of f(X) in 
D(-)? 

Let x € D(I-) be such a root. Then for any j, x; = 1 — (1 — x)” € D(I-) 
is also a root since log,(1 — x;) = p’ log,(1 — x) = 0. But for 7 sufficiently 
large, we have x; € D(p~/®-»-). For x; € D(p~ ~~), we have | — x; = 
exp,(log,(1 — x;)) = exp, 0 = 1. Hence (1 — x)” = 1, and x must be one 


Figure IV.4 


100 


4 Newton polygons for power series 


of the roots we already considered. Thus, the appearance of the Newton 
polygon agrees with our knowledge of all of the roots of log,(1 — X). 

We now proceed to prove that the Newton polygon plays the same role for 
power series as for polynomials. But first we prove a much simpler result: 
that the radius of convergence of a power series can be seen at a glance from 
its Newton polygon. 


Lemma 5. Let b be the least upper bound of all slopes of the Newton polygon of 
A(X) = 14+ SRiaXtel + XQ([[X]]. Then the radius of convergence is 
p? (6 may be infinite, in which case f(X) converges on all of Q). 


Proor. First let |x|, < p®, ie., ord, x > —b. Say ord, x = —b’, where 
b’ < b. Then ord,(a,x‘) = ord, a; — ib’. But it is clear (see Figure 5) that, 
sufficiently far out, the (i, ord, a;) lie arbitrarily far above (i, 6’/), in other 
words, ord,(a,x') > 00, and f(X) converges at X = x. 


Figure IV.5 


Now let |x|, > p®, ie., ord, x = —b’ < —b. Then we find in the same 
way that ord,(a,x') = ord, a, — b’iis negative for infinitely many values of i. 
Thus f(x) does not converge. We conclude that f(X) has radius of conver- 
gence exactly p’. O 


Remark. This lemma says nothing about convergence or divergence 
when |x|, = p®. It is easy to see that convergence at the radius of convergence 
(‘on the circumference”’) can only occur in type (3) in the list at the beginning 
of this section, and then if and only if the distance that (i, ord, a;,) lies above 
the last (infinite) segment approaches oo as i->0o. An example of this 
behavior is the power series f(X) = 1 + >2%, p'X”', whose Newton polygon 
is the horizontal line extending from (0,0). This f(X) converges when 
ord, x = 0. 

One final remark should be made before beginning the proof of the power 
series analogue of Lemma 4. If ce Q, ord, c = A, and g(X) = f(X/c), then 
the Newton polygon for g is obtained from that for f by subtracting the line 
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y = Ax—the line through (0, 0) with slope A—from the Newton polygon for f- 
This is because, if f(X¥) = 1+ Sa,X' and g(X) = 1+ 5.6,X', then we 
have ord, b; = ord,(a,/c') = ord, a; — Xi. 


Lemma 6. Suppose that A, is the first slope of the Newton polygon of f(X) = 
1+ S2,4aX'el + XO[LX]]. Let ce Q, ord, c =A < Ay. Suppose that 
JS(X) converges on the closed disc D(p*) (by Lemma 5, this automatically 
holds ifX < 2, or if the Newton polygon of f{(X) has more than one segment). 
Let 


a(X) = (1 -—ecX)f(xX)el + XQ(LX]]. 


Then the Newton polygon of g(X) is obtained by joining (0, 0) to (1, A) and 
then translating the Newton polygon of f(X) by | to the right and X upward. 
In other words, the Newton polygon of g(X) is obtained by “joining” the 
Newton polygon of the polynomial (1 — cX) to the Newton polygon of the 
power series f(X). In addition, if f(X) has last slope 4, and converges on 
D(p“‘), then g(X) also converges on D(p*‘). Conversely, if g(X) converges on 
D(p*5), then so does f(X). 


PRoorF. We first reduce to the special case c = I, A = 0. Suppose the lemma 
holds for that case, and we have f(X) and g(X) as in the lemma. Then /,(X) = 
S(X/c) and g,(X) = (I — X)f,(X) satisfy the conditions of the lemma with 
c, A, A, replaced by 1, 0,A, — A, respectively (see the remark immediately 
preceding the statement of the lemma). Then the lemma, which we’re assum- 
ing holds for f,; and gj, gives us the shape of the Newton polygon of g,(X) (and 
the convergence of g, on D(p*s~*) when f converges on D(p’s)). Since g(X) = 
g.i(cX), if we again use the remark before the statement of the lemma, we 
obtain the desired information about the Newton polygon of g(X). (See 
Figure 6.) 

Thus, it suffices to prove Lemma 6 with c = 1,A = 0. Let g(X) =1 + 
>21 5X‘. Then, since g(X) = (1 — X)f(X), we have 64, = a4, — @ for 
i > 0 (with a, = 1), and so 


ord, b;,, = min(ord, a,,,1, ord, a), 


with equality holding if ord, a,,, # ord, a; (by the isosceles triangle principle). 
Since both (i, ord, a;) and (i, ord, a;,1) lie on or above the Newton polygon 
of f(X), so does (/, ord, 6,1). If (i, ord, a;) is a vertex, then ord, 4,41 > 
ord, @,, and so ord, 6;,,; = ord, a,;. This implies that the Newton polygon of 
g(X) must have the shape described in the lemma as far as the last vertex of 
the Newton polygon of f(X). It remains to show that, in the case when the 
Newton polygon of f(X) has a final infinite slope A,, g(X) also does; and, if 
J(X) converges on D(p*‘), then so does g(X) (and conversely). Since ord, 5; 41 
> min(ord, a;+ 1, ord, a;), it immediately follows that g(X) converges wher- 
ever f(X) does. We must rule out the possibility that the Newton polygon of 
g(X) has a slope A, which is greater than A,. If the Newton polygon of g(X) 
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fi &1 


Figure IV.6 


did have such a slope, then for some large i, the point (i + 1, ord, a;) would 
lie below the Newton polygon of g(X). Then we would have ord, b; > 
ord,a; for all 7 >i+ 1. This first of all implies that ord, a;,, = ord, q;, 
because a;,, = b;,, + a;; then in the same way ord, a;,. = ord, a;41, and 
so on: ord, a; = ord, a; for all j > i. But this contradicts the assumed con- 
vergence of f(X) on D(1). The converse assertion (convergence of g implies 
convergence of f) is proved in the same way. 


Lemma 7. Let f(X) = 1+ 314,X'e1 + XQ[LX]] have Newton polygon 
with first slope d,. Suppose that f{(X) converges on the closed disc D(p*1), 
and also suppose that the line through (0, 0) with slope 4, actually passes 
through a point (i, ord, a;). (Both of these conditions automatically hold if 
the Newton polygon has more than one slope.) Then there exists an x for 
which ord, x = —A, and f(x) = 0. 


Proor. For simplicity, we first consider the case A; = 0, and then reduce the 
general case to this one. In particular, ord, a, = 0 for all i and ord, a, > 0 
as i-> 00. Let N > 1 be the greatest i for which ord, a, = 0. (Except in the 
case when the Newton polygon of /(X) is only one infinite horizontal line, 
this N is the length of the first segment, of slope A, = 0.) Let f,(X) = 
1 + St, a,X'. By Lemma 4, forn > N the polynomial f,(X) has precisely NV 
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roots Xn,1, +++, Xn,v With ord, xX,,; = 0. Let xy = xy,1, and for n > N let 
Xn+1 be any of the Xn41,15 -- +3 Xn¢a,w With |x,41,; — X,{p minimal. We claim 
that {x,} is Cauchy, and that its limit x has the desired properties. 

For n = N let S, denote the set of roots of f,(X) (counted with their 
multiplicities). Then for n => N we have 


|fn+1(%n) —fil%n)|p = |fn+1(%n)|p (since f,(x,) = 0) 


Xn 


x 


XESy +1 Pp 


a Il |1 — xn/Xn+i4lp (sinceif x € S,,, has ord, x < 0, 
i we then have {1 — x,/x|, = 1) 
=[[leasie ale (ince [ap ssule = 1) 
> [Xn+1 — Xalp”s 
by the choice of x,.,. Thus, 
[Xn+1 — XnlpY < |fa+iQn) — Sa%n)lp = lane int ly = [@nsale- 


Since |a,.:|, > 0 as n> ©0, it follows that {x,} is Cauchy. 
If x, > x € Q, we further have f(x) = lim, f,(x), while 


n xi x! 
3 ota 
f=1  ¥ — Xn 
since |a;|p < l and |(x' — x_)/(x — Xn)|p = [xtot + xt- 2x, + xf 3xy? + --- 
+ xi "|, < 1. Hence, f(x) = lim,.. f,(x) = 0. This proves the lemma when 
Ay = 0. 

Now the general case follows easily. Let 7 € Q be any number such that 
ord, 7 = 4,. Note that such a a exists, for example, take an ith root of an 
a; for which (i, ord, a;) lies on the line through (0, 0) with slope A,. Now let 
g(X) = f(X/z). Then g(X) satisfies the conditions of the lemma with A, = 0. 


So, by what’s already been proved, there exists an Xx) with ord, xX) = 0 and 
g(Xo) = 0. Let x = xo/7. Then ord, x = —A, and f(x) = f(%o/7) = g(x) = 0. 
Oo 


< |x -— xl, 
iP 


CV» = If) — AriQle = 1x — Xnlp 


Lemma 8. Let f(X) = 1 + 314X'e1 + XQ[[X]] converge and have 
value 0 at a. Let g(X) = 1 + 3721 5,X' be obtained by dividing f(X) by 
1 — X/a, or equivalently, by multiplying f(X) by the series 1 + X/« + 
X7Ja? +--+. + Xa! +--+. Then g(X) converges on D(\e\,). 

Proor. Let f,(X) = 1 + >?., a,X'. Clearly, 


b, = 1/a! + a,/ot-3 + az[ot-? test + A -y/e + a, 
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so that 
bot = f(a). 
Hence |b,«'|, = | fi(«)|p > 0 as i 00, because f(a) = 0. oO 


Theorem 14 (p-adic Weierstrass Preparation Theorem). Let f(X) = 1+ 

Dre. aX'el + XQ[LX]] converge on D(p*). Let N be the total horizontal 
length of all segments of the Newton polygon having slope < Xif this horizontal 
length is finite (i.e., if the Newton polygon of f{(X) does not have an infinitely 
long last segment of slope A). If, on the other hand, the Newton polygon of 
S(X) has last slope 2, let N be the greatest i such that (i, ord, a,) lies on that 
last segment (there must be a greatest such i, because f{(X) converges on 
D(p?)). Then there exists a polynomial h(X)¢€1 + XQ[X] of degree N and 
a power series g(X) = 1 + 3/72, 5,X' which converges and is nonzero on 
D(p?), such that 


W(X) = f(X)-8(X). 


The polynomial h(X) is uniquely determined by these properties, and its 
Newton polygon coincides with the Newton polygon of f(X) out to (N, 
ord, ay). 


Proor. We use induction on N. First suppose N = 0. Then we must show 
that g(X), the inverse power series of f(X), converges and is nonzero on 
D(p?). This was part of Exercise 3 of §IV.1, but, since this is an important 
fact, we’ll prove it here in case you skipped that exercise. As usual (see the 
proofs of Lemma 6 and 7 and the remark right before the statement of 
Lemma 6), we can easily reduce to the case A = 0. 

Thus, suppose f(X) = 1+ >4,X', ord, a, > 0, ord, a,—> 00, g(X) = 
1 + > 5,X'. Since f(X)g(X) = 1, we obtain 


b; = —(b,- 1a, + by 22 tee t bya;- 4 + a;) for i = 1, 


from which it readily follows by induction on i that ord, b; > 0. Next, we 
must show that ord, b; > oo as i> oo. Suppose we are given some large M. 
Choose m so that i > m implies ord, a, > M. Let 


e = min(ord, a, ord, a2, ..., ord, a,) > 0. 


We claim that i > nm implies that ord, b, > min(M, ne), from which it will 
follow that ord, b, > co. We prove this claim by induction on 7. It’s trivial 
for n = 0. Suppose n = 1 and i > nm. We have 


By = — (by, + + Bi mOm + Bice Amer H+ + a). 
The terms 5,_,a; with j > m have ord,(b,_,a;)> ord, a; > M, while the 
terms with j < m have ord,(5,_,a;) > ord, b;_; + e« > min(M, (n — le) + € 
by the induction assumption (since i — j > (n — 1)m) and the definition of e. 


Hence all summands in the expression for b, have ord, > min(M, ne). This 
proves the claim, and hence the theorem for N = 0. 
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Now suppose N > 1, and the theorem holds for N — 1. Let A, < X be 
the first slope of the Newton polygon of f(X). Using Lemma 7, we find an « 
such that f(«) = 0 and ord, « = —A,. Let 


fo =so(t+ T+ eee Se) 


=1+ >Da/Xtel + XOX]. 


By Lemma 8, /,(X) converges on D(p*:). Let c = I/a, so that: f(X) = 
(1 — cX)f,(X). If the Newton polygon of f,(X) had first slope A,’ less than A,, 
it would follow by Lemma 7 that /,(X) has a root with p-adic ordinal —),’, 
and then so would f(X), which it is easy to check is impossible. Hence A,’ > 1, 
and we have the conditions of Lemma 6 (with f,, fj 4,’, and A, playing the roles 
of f, g, A,, and A, respectively). Lemma 6 then tells us that f,(X) has the same 
Newton polygon as /(X), minus the segment from (0, 0) to (I, A,). In addition, 
in the case when f (and hence f,) have last slope A, because f converges on 
D(p*), Lemma 6 further tells us that f,; must also converge on D(p’). 

Thus, f,(X) satisfies the conditions of the theorem with N replaced by 
N — 1. By the induction assumption, we can find an 4,(X) ei + XQ[X] of 
degree N — 1 and a series g(X)e1 + XQ([[X]] which converges and is 
nonzero on D(p*), such that 


h,(X) = fi(X)-g(%). 


Then, multiplying both sides by (1 — cX) and setting A(X) = (1 — cX)h,(X), 
we have 


WX) = f(X)-8(X), 


with h(X) and g(X) having the required properties. 

Finally, suppose that h(X)¢1+ XQ[X] is another polynomial of 
degree N such that A(X) = f(X)g,(X), where g,(X) converges and is non- 
zero on D(p*). Since h(X)g(X) = S(X)g(X)e1(X) = h(X)g1(X), unique- 
ness of h(X) follows if we prove the claim: hg = hg, implies that h and h 
have the same zeros with the same multiplicities. This can be shown by 
induction on N. For N = 1 it is obvious, because h(x) = 0<A(x) = 0 for 
x € D(p*). Now suppose N > 1. Without loss of generality we may assume 
that —/ is ord, of a root « of h(X) having minimal ord,. Since « is a root of 
both h(X) and h(X) of minimal ord,, we can divide both sides of the 
equality h(X)g(X) = h(X)g,(X) by (1 — X/a), using Lemma 8, and thereby 
reduce to the case of our claim with N replaced by N — 1. This completes the 
proof of Theorem 14. O 


Corollary. Ifa segment of the Newton polygon of f(X)€1 + XQ{[[X]] has finite 
length N and slope A, then there are precisely N values of x counting 
multiplicity for which f(x) = 0 and ord, x = —A. 
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Another consequence of Theorem 14 is that a power series which converges 
everywhere factors into the (infinite) product of (1 — X/r) over all of its roots 
r, and, in particular, if it converges everywhere and has no zeros, it must be a 
constant. (See Exercise 13 below.) This contrasts with the real or complex 
case, where we have the function e* (or, more generally, e”, where / is any 
everywhere convergent power series). In complex analysis, the analogous 
infinite product expansion of an everywhere convergent power series in terms 
of its roots is more complicated than in the p-adic case; exponential factors 
have to be thrown in to obtain the ‘Weierstrass product” of an ‘‘entire” 
function of a complex variable. 

Thus, the simple infinite product expansion that results from Theorem 14 
in the p-adic case is possible thanks to the absence of an everywhere conver- 
gent exponential function. So in the present context we’re lucky that exp, has 
bad convergence. But in other contexts—for example, p-adic differential 
equations—the absence of a nicely convergent exp makes life very compli- 
cated. 


EXERCISES 


1. Find the Newton polygon of the following polynomials: 

(i) 1 — X + px? (ii) 1 — X3/p? (iii) 1 + X2 + pX* + p?Xx® 
(iv) >P-1 1X*- 2 (v) ( — X)U — px)U — p?.X) (do this in two ways) 
(vi) [1221 (1 — iX). 

2. (a) Let f(X¥) €1 + XZ,[X] have Newton polygon consisting of one segment 
joining (0, 0) to the point (n, m). Show that if 1 and m are relatively prime, 
then f(X) cannot be factored as a product of two polynomials with coefficients 
in Z,. 

(b) Use part (a) to give another proof of the Eisenstein irreducibility criterion 
(see Exercise 14 of §1.5). 

(c) Is the converse to (a) true or false, i.e., do all irreducible polynomials have 
Newton polygon of this type (proof or counterexample)? 


3. Let f(X)€1 + XZ,[X] be a polynomial of degree 2”. Suppose you know 
that, whenever « is a reciprocal root of f(X), so is p/x (with the same multipli- 
city). What does this tell you about the shape of the Newton polygon? Draw 
all possible shapes of Newton polygons of such f(X) when n = 1, 2, 3, 4. 


4. Find the Newton polygon of the following power series: 
(i) Do Xx?" (ii) SZ o(pXY +X") (iii) Tio i X! 
(iv) dro Xi! (v) Ch — pX*)/(1 — p?X) (vi) (1 — p?X)/(1 — pX) 
(vii) TT20(1 — piX) (viii) Dio pY71X' 

5. Show that the slopes of the finite segments of the Newton polygon of a power 
series are rational numbers, but that the slope of the infinite segment (if there 
is one) need not be (give an example). 

6. Show by a counterexample that Lemma 7 is false if we omit the condition 


that the line through (0, 0) with slope A, pass through a point (/, ord, aj), 
i> 0. 
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Suppose f(X)e€ 1 + XQO[[X]] has Newton polygon which is the degenerate case (3), 
Le., a vertical line through (0, 0). In other words, if the vertical line through (0, 0) is 
rotated counterclockwise at all, it passes above some points (i, ord, a;). Prove that 
J (x) diverges for any nonzero x EQ. 


. Let f(X) = 14+ 52, a;X!'e1 + XQ[[X]] converge in D(p*) where A is a rational 


number. Prove that max, < pipy| f(x)|p is reached when |x|, = p’*, i.e., on the “cir- 
cumference,” and that this maximal value of f(x) has p-adic ordinal equal to 


min; =0,1,--.(ordp a id), 


i.e., the minimum distance (which may be negative) of the point (i, ord, a,) 
above the line through (0, 0) with slope A. 


Let f(X) = So aX'e Z,[[X]]. Suppose that f(X) converges in the closed 
unit disc D(1). Further suppose that at least two of the a, are not divisible by 
p. Prove that f(X) has a zero in D(1). 


. Let f(X) be a power series which converges on D(r) and has an infinite 


number of zeros in D(r). Show that f(X) is identically zero. 
Prove that E,(Y) converges only in D(1~) (i.e., not in D()). 


Let g(X) = A(X)/fCX), where g(X)e1 + XQ[LX]] has all coefficients in 
D(1), and where ACX) and f(X)eE1 + XOQLX] are polynomials with no 
common roots. Prove that A(X) and f(X) also have all coefficients in D(1). 


Suppose that f(X) € 1 + XQ[LX]] converges on all of 2. For every A, let A,(X) 
be the A(X) in Theorem 14. Prove that 4, — fas A — © (i.e., each coefficient 
of A, approaches the corresponding coefficient of f). Prove that f has in- 
finitely many zeros if it is not a polynomial (but only a countably infinite set 
of zeros ry, ro, ...), and that f(X) = []721(1 — X/r). In particular, there is 
no nonconstant power series which converges and is nonzero everywhere (in 
contrast to the real or complex case, where, whenever ACX) is any everywhere 
convergent power series, the power series e” is an everywhere convergent 
and nonzero power series). 


CHAPTER V 


Rationality of the zeta-function of a set 
of equations over a finite field 


1. Hypersurfaces and their zeta-functions 


If Fis a field, let A% denote “n-dimensional affine space over F,”’ 1.e., the set 
of ordered n-tuples (x,, ...,x,) of elements x, of F. Let f(X%y, ..., X,)€ 
F[X, ..., X,] be a polynomial in the n variables X,, ..., X,. By the affine 
hypersurface defined by fin AZ, we mean 


Hy = (X15 + +s Xn) © AB | SQ, -- +5 Xn) = O}. 


The number n — 1 is called the dimension of H,. We call H, an affine curve if 
n = 2, i.e., if H, is one-dimensional. 

The companion concept to affine space is projective space. By n-dimen- 
sional projective space over F, denoted P?, we mean the set of equivalence 
classes of elements of 

AR** — {(0, 0, ..., 0)} 
with respect to the equivalence relation 
(Xo, X15 -- +> Xn) ~ (Xo, X15. Xn) TAG F* withx,) = Ax, i =0,...,n. 
In other words, as a set P$ is the set of all lines through the origin in A}*?. 

Az can be included in P? by the map (x, ..., x,)+> (1, %1, ..., X,)- 
The image of A} consists of all of P}? except for the “‘ hyperplane at infinity” 
consisting of all equivalence classes of (n + 1)-tuples with zero xo-coordinate. 
That hyperplane looks like a copy of P#~', by virtue of the one-to-one 
correspondence 


equiv. class of (0, x1, ..., X,) +> equiv. class of (x,, ..., X,)- 


(For example, if n = 2, the projective plane P? can be thought of as the 
affine plane plus the “line at infinity.’’) Continuing in this way, we can write 
P} as a disjoint union 


Ar" U Az U Ag-?U ...U Af U point. 
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By a homogeneous polynomial f(Xo, ..., Xn) € F[Xo, ..., Xn] of degree d 
we mean a linear combination of monomials of the same total degree d. For 
example, Xo° + Xo?X, — 3X,X2X3 + X,° is homogeneous of degree 3. 
Given a polynomial f(X%, ..., X,)¢ FLX, ..., X,] of degree d, its homo- 
geneous completion f(X», X;, ..., X,) is the polynomial 

X6L(X1/Xo, -- +> Xn/ Xo), 
which is clearly homogeneous of degree d. For example, the homogeneous 
completion of X32 — 3X,X.X3 + X, + 1 is the above example of a homo- 
geneous polynomial of degree 3. 

If #(Xo, ..., X,) is homogeneous, and if f(x, ...,x,) = 0, then also 
FO; .--,AX,) = 0 for Ae F*. Hence it makes sense to talk of the set of 
points (equivalence classes of (n + 1)-tuples) of P? at which f vanishes. That 
set of points A; is called the projective hypersurface defined by fin P2. 

If f(Xo, ..., X,) is the homogeneous completion of f(X,, ..., X,), then 
; is called the projective completion of H,. Intuitively, H; is obtained from 
H, by “throwing in the points at infinity toward which H, is heading.” For 
example, if H, is the hyperbola (say F = R) 

x? _ x2? 
a? b? 
then /(Xo, X1, X2) = X12/a? — X,2/b? — Xo, and H; consists of 
{(, Xi, X2)|Xi7/a? — X27/b? = 1} {O, 1, X2)|X2 = +4/a}, 
i.e., H, plus the points on the line at infinity corresponding to the slopes of 
the asymptotes. 

Now let XK be any field containing F. If the coefficients of a polynomial are 

in F, then they are also in K, so we may consider the “ K-points” of Hy, ie., 


HAK) = {%, ---s Xn) EAL | f(%1, «--5 Xn) = Of. 


If /(Xo, ..., Xn) is homogeneous, we similarly define H;(K). 

We shall be working with finite fields F = F, and finite field extensions 
K = F,». In that case H,(K) and A,(K) consist of finitely many points, since 
there are only finitely many (namely, g‘") n-tuples in all of A (and only 
finitely many points in P?). In what follows H, (or ;) will be fixed through- 
out the discussion. In that case we define the sequence N,, No, Ng, ... to be 
the number of F,-, F,2-, F43-, ... -points of H, (or Hy), i.e., 


N, = HAF): 


Given any sequence of integers such as {N,} which has geometric or number 
theoretic significance, we can form the so-called “generating function” 
which captures all the information conveyed by the sequence {N,} in a 
power series. This is the ‘“‘zeta-function,” which is defined as the formal 
power series 


= 1, 


exp( > N.T"'|s) € QO[[7]]. 
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We write this function as Z(H,/F,; T), where F, indicates what the original 
field F was. Note that the power series Z(H,/F,; T) has constant term 1. 
Before giving some examples, we prove a couple of elementary lemmas. 


Lemma 1. Z(H,/F,; T) has coefficients in Z. 


Proor. We consider the K-points P = (x,, ..., X,) of H, (K a finite extension 
of F,) according to the least s = 5g for which all x;€ Faso. If Pp=(X1j, -- +5 Xnj)s 
j= 1, ..., 50, are the “conjugates” of P, i.e., Xi1, ..., Xis) are the conjugates 
of x; = x;, over F,, then the P, are distinct, because if all of the x, are left 
fixed by an automorphism o of F,* over F,, it follows that they are all in a 
smaller field (namely, the ‘fixed field” of o: {x € Fas | o(x) = x}). 

Now let’s count the contribution of P,, ..., P,, to Z(H,/F,; T). Each of 
these points is an F,s-point of H, precisely when Fs > Fo, i.e., when so|s (see 
Exercise | of §I1I.1). Thus, these points contribute so to N,,, Naso, Nasos - - «>» 
and so their contribution to Z(H,/F,; T) is: 


exp( >. soT|j50) = exp(—log(1 — T*%c)) = = - > T%o, 
j=1 oe tae = 


The whole zeta-function is then a product of series of this type (only finitely 
many of which has first J-term with degree <5 ), and so has integer co- 
efficients. oO 


Remark. Note that a corollary of the proof is that the coefficients are 
nonnegative integers. 


Lemma 2. The coefficient of T’ in Z(H,/F,; T) is < q™. 


Proor. The maximum value for N, is q™ = + Aj,,. The coefficients of 
Z(H,/F,; T) are clearly less than or equal to the coefficients of the series with 
N, replaced by q"*. But 


exp( > onT|s) = exp(—log(1 — q*T)) = 1/1 — g*T) = > ge. O 


As a simple example, let’s compute the zeta-function of an affine line 
L = Hx, © Aj,. We have N, = 9°, and so 


Z(L/F ai T) = exp(S ¢T'/s) = exp(—log(t ~ 97) = pa 


The zeta-function is defined analogously for projective hypersurfaces, 
where now we use 


N, def +(A7(F,*)). 
For example, for a projective line Z we have N, = q* + 1, and so 


Z(L/F,;T) = exp(X(q°T'/s + T*/s)) 


= exp(—log(l — qT) — log(1 — T)) = Esser 
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It turns out that it’s much more natural to work with projective hypersurfaces 
than with affine ones. 

For example, take the unit circle X,? + X,? = 1, whose projective 
completion is A;, f= X,2 + X22 — X,2. It’s easier to compute Z(A;/F,; T) 
than Z(4,/F,; T). (We’re assuming that p=char F, # 2.) Why is it easier? 
Because there is a one-to-one correspondence between A;(K) and Z(K) (Z 
denotes the projective line). To construct this map, project from the south 
pole onto the line X¥, = 1, as shown in Figure 1. A simple computation gives: 


x2 


Figure V.1 


x, = 4t/(4 + 17), x2 = (4 — t*)/(4 + £7), t = 2x,/(x2 + 1). This map goes 
bad in the ¢-to-x direction if 2 = —4, i.e., for 2 values of ¢ if g’ = 1 (mod 4) 
and no values of ¢ if g° = 3 (mod 4) (see Exercise 8 of §III.1). It goes bad in 
the other direction when x, = —1, x, = 0. But if we take the projective 
completions and let (Xo, Xi, X2) be coordinates for the completed circle 
and (X,’, X,’) for the completed line, then it is easy to check that we have a 
perfectly nice one-to-one correspondence given by 


(Xo, X1') > (4%0'? + 1", 4X0'%1', 4X0’? — 1); 
(Xo, X1, Xe) FH (Kg + Xo, 2x,) if (Xo + Xo, 2X1) ¥ (0,0), and (0, 1) otherwise. 


The reader should carefully verify that this does in fact give a one-to-one 
correspondence between the projective line and the set of equivalence classes 
of triples (xo, 1, Xa) satisfying x,2 + x22 — x92 = 0. Thus, since N,'is the 
same for A; and I, we have 

Z(AG/F,; T) = Z(L/F,; T) = 1/( — TQ — 4T)). 


If we wanted to know Z(H,/F,;T), f = X,2 + X.? — 1, we'd have to 
subtract from WN, the points “at infinity” on A;, i.e., those for which x,? + 
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Xq”2 = Xo” and xX) = 0. There are 2 such points whenever —1 has a square 
root in F, and no such points otherwise. 


Case (1). gq = 1 (mod 4). Then —1 always has a square root in F,+ (see 
Exercise 8 of §III.1), and 


N, = N, - 2, 
am — ZOA/F 37) _ WMG -1T)0A—49M))_ 1-T 
Zk 7) exp( 3, 2r"/s) id — T)* 1—qT 


Case (2). q = 3(mod 4). Then it is easy to see that N, = N, if s is odd, 
and N, = N, — 2 if s is even, so that 


Notice that in all of these examples, as well as in the examples in the 
exercises below, the zeta-function turns out to be a rational function, a ratio 
of polynomials. This is an important general fact, which Dwork first proved 
in 1960 using an ingenious application of p-adic analysis. 


Theorem (Dwork). The zeta-function of any affine (or projective—see Exercise 
5 below) hypersurface is a ratio of two polynomials with coefficients in Q 
(actually, with coefficients in Z and constant term I—see Exercise 13 
below). 


The rest of this chapter will be devoted to Dwork’s proof of this theorem. 

We note that zeta-functions of hypersurfaces can be generalized to a 
broader class of objects, including affine or projective ‘‘algebraic varieties,” 
which are the same as hypersurfaces except that they may be defined by more 
than one simultaneous polynomial equation. Dwork’s theorem holds for 
algebraic varieties (see Exercise 4 below). 

Dwork’s theorem has profound practical implications for solving systems 
of polynomial equations over finite fields. It implies that there exists a finite 
set of complex numbers @,,..., ,, 8;,..., 8, such that for alls = 1, 2, 3,... 
we have N, = >t_, %° — >¥., 8° (see Exercise 6 below). In other words, 
once we determine a finite set of data (the a, and £,)—and this data is already 
determined by a finite number of N,—we’ll have a simple formula which 
predicts all the remaining N,. Admittedly, in order to really work with this in 
practice, we must know a bound on the degree of the numerator and denomi- 
nator of our rational function Z(H/F,; T) (see Exercises 7-9 below for more 
details). Actually, in all important cases the degree of the numerator and 
denominator, along with much additional information, is now known about 
the zeta-function. This information is contained in the famous Weil Conjectures 
(now proved, but whose proof, even in the simplest cases, goes well beyond 
the scope of this book). 
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V_ Rationality of the zeta-function of a set of equations over a finite field 


The rationality of the zeta-function was part of this series of conjectures 
announced by A. Weil in 1949. Dwork’s proof of rationality was the first 
major step toward the proof of the Weil Conjectures. The final step, Deligne’s 
proof in 1973 of the so-called ‘‘ Riemann Hypothesis” for algebraic varieties, 
was the culmination of a quarter century of intense research on the conjectures. 

In the case of a projective hypersurface H; in n-dimensional space that is 
smooth (i.e., for which the partial derivatives of f with respect to all the 
variables never vanish simultaneously), the Weil Conjectures say that the 
following: 


(i) Z(AG/Fy: T) = PIT)**( — TIA — aT) — @?T)---(1 — ge“), 
where P(T)e1 + TZ[T] has degree B, where 8 is a number related to the 
“topology” of the hypersurface. (called its ‘Betti number;” when 4; is a 
curve, this is twice the genus, or “number of handles,”’ of the corresponding 
Riemann surface). Here the + 1 means we take P(T) if n is even and 1/P(T) 
if n is odd. 

(ii) If @ is a reciprocal root of P(T), then so is qg”~*/e. 

(iii) The complex absolute value of each of the reciprocal roots of P(T) is 
q”~ 2, (This is called the ‘‘ Riemann Hypothesis” part of Weil’s conjectures, 
out of analogy with the classical Riemann Hypothesis for the Riemann 
zeta-function—see Exercise 15 below.) 


EXERCISES 
1. What is the zeta-function of a point? What is Z(Ag,/F,; T)? 
2. Compute Z(P£,/F,; T). 


3. Let f(Xi, ..., Xn) = Xn + 2C%, ..., Xn-1), where geF (Xi, ..., Xn-11. 
Prove that 


Z(H,/Fq;T) = Z(AG,*/Fa3 T). 


4. Let f(y, ..., Xn), fol X, 0, Xn) ACM, .-, Xn) © FLX, .-, Xn], and 
let Ais, .o.....s(Fy) © Af,s be the set of n-tuples of elements of F,* which 
satisfy a// of the equations f, = 0,i = 1,2,...,7r 


Aisi ta... rFq ) 5 im = {(x1, wey Xn) © Abys LAG, «2-5 Xn) = °° 
= fix, «+, Xn) = O}. 
Such an H is called an (affine) algebraic variety. Let VN; = +A (F,s) (where H 
is short for Hiy,,..., r,»), and define the zeta-function as before: Z(H /Fq; T) &, 


exp(>1 N.7°/s). Prove that Dwork’s theorem for affine hypersurfaces 
implies Dwork’s theorem for affine algebraic varieties. 


5. Prove that if Dwork’s theorem holds for affine hypersurfaces, then it holds 
for projective hypersurfaces. 
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6. 


7 


8. 


11. 


12. 


13. 


14. 


15. 


Exercises 


Prove that Dwork’s theorem is equivalent to: There exist algebraic complex 
numbers a, ..., &, 6;,--., B,, such that each conjugate of an « is an «, each 
conjugate of an « is an «, each conjugate of a B is a B, and we have: 


N,= > af — > Bs, for alls = 1,2,3,.... 
=1 t=1 
It is known that the zeta-function of a smooth cubic projective curve E = Ay 
(thus, dim £ = 1, deg f = 3; Eis called an “elliptic curve”) is always of the 
form: (1 + aT + qT?)/[(1 — T)( — qg7)] for some ae Z. Show that if you 
know the number of points in E(F,), you can determine: (1) a, and (2) 
++E(F,s) for any s. 
Using the fact stated in the previous problem, find Z(A;7/F,; 7) when 
f(%, X2) is: 

(i) X22 — X13 — landg = 2 (mod 3); 
(ii) X2? — X13 + X, andg = 3(mod4), alsog = 5, 13, and 9. 
Suppose we know that Z(H/F,; T) is a rational function whose numerator 
has degree m and whose denominator has degree n. Prove that N, = +-H(F,:) 
for s = 1, 2,3, ...,m + n uniquely determine all of the other N,. 
Compute Z(A,/F,; T) when H; is the 3-dimensional hypersurface defined by 

X1X4 = XoX3 = 1. 

Compute Z(H;/F,; T) and Z(4;/F,; T) (f = homogeneous completion of f) 
when H, is the curve: 

(i) X,X2 = 0 (ii) X,X2(X%, + X2. + 1) = 0 (iii) X22 — Xi? = 1 
(iv) X,? = X,3 (v) X,? = X,3 + X,?, 

Lines in P* are obtained by intersecting two distinct hyperplanes, i.e., a line 
is the set of equivalence classes of quadruples which satisfy simultaneously 
two given linear homogeneous polynomials. Let N, be the number of lines in 


P2,s- Using the same definition of the zeta-function in terms of the N; as 
before, compute the zeta-function of the set of lines in projective 3-space. 


Using Exercise 12 of §I1V.4, prove that Dwork’s theorem, together with 
Lemma 1 above, imply that Dwork’s theorem holds with “coefficients in Q”’ 
replaced by ‘coefficients in Z and constant term 1.” 


Let H, be given by X,? = X,° + 1, and let p = 3 or 7 (mod 10). Assuming the 
Weil Conjectures for the genus 2 curve H;, prove that 

1+ p?T* 
(1—T)G — pT) 


Let A; be a smooth projective curve. Assuming the Weil Conjectures for 
Z(H;/F,; T) (which were proved for curves much earlier than for the general 
case), show that all of the zeros of the complex function of a complex variable 


F(s) =, Z(Aj/Fa3 475) 


Z(Aj/F,; T) = 


are on the line Re s = 4. This explains the name ‘‘ Riemann Hypothesis” for 
part (iii) of the Weil Conjectures. 
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V_ Rationality of the zeta-function of a set of equations over a finite field 


2. Characters and their lifting 


An Q-valued character of a finite group G is a homomorphism from G to the 
multiplicative group Q* of nonzero numbers in ©. Since gréerofs = | for 
every g &G, it follows that the image of G under a character must be roots 
of unity in 2. For example, if G is the additive group of F,,, if e is a pth root 
of 1 in Q, and if @ denotes the least nonnegative residue of ae€ F,, then the 
map a+> e* is a character of F,. In what follows, we shall omit the tilde and 
simply write a@+> «*. Ife # 1, then the character is “nontrivial,” i.e., its image 
is not just 1. 

If F, is a finite field with q = p* elements, we know that there are s = 
[F,:F,] automorphisms oo, ..., o,-, of F, given by: o,(a) = a” for ae F, (see 
Exercise 6 of §III.1). If ae F,, by the trace of a, written Tr a, we mean 


s-1 
Tras > ofa) =a+a?+a"4---+ a7", 
1=0 
It is easy to see that (Tra)? = Tra, ie., TraeF,, and that Tr(a + 5b) = 
Tr a + Tr b. It then follows that the map 


aQt> eta 


is an Q-valued character of the additive group of F,. 

Recall that for every ae F, we have a unique Teichmiiller representative 
te Q, lying in the unramified extension K of @, generated by the (¢ — 1)st 
roots of 1, such that ¢? = ¢ and a is the reduction of ¢ mod p. Our purpose 
in this section is to find a p-adic power series ©(T) whose value at T = t 
equals «T™*. (More precisely, we'll get the value of @(T)@(T”)@(T”’)- -- 
©(T?*~*), where q = p’, to be e™* at T = 1.) 

Now fix a € F,*, and let t € K be the corresponding Teichmiiller representa- 
tive. Let Tr, denote the trace over Q, of an element of K, i.e., the sum of its 
conjugates. Then for our Teichmiiller representative t we have (see Exercise | 
at the end of §4 below) 


Tret=tbertrrg.-- +t? eZ, 
and the reduction mod p of Trx t¢ is 
ata?+a"+4.--+ a") = Trae Fy. 


Hence, since « raised to a power in Z, depends only on the congruence class 
mod p, we can write: e™? = eT xt, 

Let \ = e — 1. We have seen that ord, A = 1/(p — 1) (see Exercise 7 of 
§I1I.4). We want a p-adic expression in ¢ for 


Ze... s-1 
(1 + ayer teres eet d L gTre, 
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2 Characters and their lifting 


The naive thing to do would be to let 
g(T) = (1 +a" = > T(T — 1)-- “anit ) ye 


and then take the series g(T)g(T")g(T*’) - - - an But the problem is: 
how to make sense out of the infinite sum g(T) for the values T = ¢ that 
interest us. Namely, as soon as ¢¢ Z,, i.e., its residue @ isn’t in F,, then 
clearly |¢ — i|, = 1 for all ie Z; then 


pe ee er ee ee 


which does not —> oo. 
What we have to do is use the better behaved series F(X, Y) introduced at 
the end of §IV.2: 


F(X, Y) = (1 + YL + Yeyer-me, 4 yeryar?-xee? 
ea So iii a 


where recall that each term on the right is understood in the sense of the 
corresponding binomial series in Q[[X, Y]]. We now consider F(X, Y) asa 
series in X for each fixed Y: 


F(X, Y) = > (x > ann ¥*), Ann © Zp, 
n=0 m=n 


where we’re using the fact that a,,,, # 0 only when m > n; this follows be- 
cause each term in the series Bix" _ xp"- 45 ,( Y?"), i.e., 


x bs ert x = Xr > Gas = x 1 _ 
— po a ae pare 1 eye pe bar I + 1 


has the power of X that appears less than or equal to the power Y*®" of Y. 
Recall that A = e — 1, and that ord, A = 1/(p — 1). We set 


O(T) = F(T, A) = > a,T", 


with a, = >m=n4m.n4”. Clearly ord, a, > n/(p — 1), since each term in a, 
is divisible by A”. Also, since the field @,(e) = Q,(A) is complete, we 
have a, € Q,(e), and @(T) € Q,(e)[[T]]. Moreover, ©(T) converges for te 
D(p"'?--), because ord, a, = n/(p — 1). 

For our fixed ¢ we now consider the series 


(LYRE = Bre at (YP). 
It is easy to prove the following formal identity in Q[[ Y]]: 


(1+ yyitet +o! = F(t, Y)F(t?, Y)--- F(t?*~3, Y). 
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V_ Rationality of the zeta-function of a set of equations over a finite field 


Namely, after trivial cancellations, the right hand side reduces to 
(1 + Yyitere ame 6| + yer -9/(] + yr2yer? ** — tip? 
x (L + Yryer**-w%... 


Since t?° = ft, we get what we want. 
Thus, if we substitute ¢ for T in O(T)O(T?) - --@(T?*~*), we obtain 


F(t, F(t, A)» + F(t?°~*, A) = (LF ayetert en tee 


= ert a 


To conclude, we have found a nice p-adic power series Q(T) = > a,T" € 
Q,(e)[[T]], satisfying ord, a, => n/(p — 1), such that the character a+> e7T? 
of F, can be obtained by evaluating @(7)-©(7”)---@(T?*~*) at the Teich- 
miiller lifting of a. © can be thought of as “‘lifting” the character of F, toa 
function on Q (more precisely, on some disc in Q, which includes the closed 
unit disc, and hence all Teichmiiller representatives). 

Liftings such as @ are important because concepts of analysis often apply 
directly only to p-adic fields, not to finite fields. If a situation involving finite 
fields—such as zeta-functions of hypersurfaces defined over finite fields— 
can be lifted to p-adic fields, we can then do analysis with them. Notice how 
important it is that our lifting © converge at least on the closed unit disc 
(rather than, say, only on the open unit disc): the points we’re mainly interested 
in, the Teichmiiller representatives, lie precisely at radius 1. 


3. A linear map on the vector space of power series 


Let R denote the ring of formal power series over Q in n indeterminates: 
R ar Q(X, Xo, ry X,) I. 


def 
A monomial X,%:X,"2--- X,%. will be denoted X“, where u is the n-tuple of 
nonnegative integers (u,, ...,u,). An element of R is then written > a,X”*, 
where u runs through the set U of all ordered n-tuples of nonnegative integers, 
and where a, € Q. 
Notice that R is an infinite dimensional vector space over 2. For each 
G & R we define a linear map from R to R, also denoted G, by 


rt>Gr, 


i.e., multiplying power series in R by the fixed power series G. 
Next, for any positive integer q (in our application g will be a power of 
the prime p), we define a linear map T,: R— R by: 


r= > aX > T(r) = > auX", 


where qu denotes the n-tuple (qu,, qua, ..., gun). For example, if n = 1, this 
is the map on power series which forgets about all X’-terms for which / is 
not divisible by g and replaces X’ by X/ in the X’-terms for which g|/. 
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3 A linear map on the vector space of power series 


Now let fog = Ta° G: R> R. If G = Suev SwX™, then ¥,,¢ is the linear 

map defined on elements X* by 
YeAX*) = TD tek") =D ser-aX™ 
weU veU 

(Here if the n-tuple qv — u is not in U, ie., if it has a negative component, 
we take g,,-, to be zero.) 

Let G,(X) denote the power series G(X = Swev SyX%. The following 
relation is easy to check (Exercise 7 below): 


GoT, = Ty° Gy = Vocy 
We define the function | | on U by: |u| = >f-, u. Let 


Ro a4o = > g,X”e R|forsome M > 0, ord, g, = M|w| forallwe u}. 
weU 

It is not hard to check that Rp is closed under multiplication and under the 
map G+>G,. Note that power series in Ry must converge when all the 
variables are in a disc strictly bigger then D(1). An important example of a 
power series in Ry is @(aX”), where X” is any monomial in X,,..., X, and 
ais in D(1) (see Exercise 2 below). 

If V is a finite dimensional vector space over a field F, and if {a,,} is the 
matrix of a linear map A: V > V with respect to a basis, then the trace of A 
is defined as 


Tr A det > Qiis 


i.e., the sum of the elements on the main diagonal (this sum is independent of 
the choice of basis—for details on this and other basic concepts of linear 
algebra, see Herstein, Topics in Algebra, Ch. 6). (The use of the same symbol 
Tr as for the trace of an element in F, should not cause confusion, since it will 
always be clear from the context what is meant.) If F has a metric, we can 
consider the traces of infinite matrices A, provided that the corresponding 
sum 57.1 a, converges. 


Lemma 3. Let Ge Ro, and let ¥ = ¥,¢. Then Tr(¥) converges for s = 


1,2,3,..., and 
(q° — 1)" Tr(¥*) = 2, G(x)- G(x?) G(x%) --- G(x"~*), 


i i i. veial sont 
where X = (X4,...,Xq)3 X7 = (x4%,..., X,"); and x#~! = 1 means xf~! 
= 1 forj = 1,2,...,n. 


Proor. We first prove the lemma for s = 1, and then easily reduce the gen- 
eral case to this special case. Since V(X") = Syeu Bqv—-uX”", we have 


Tr Y a >. Biq-1)us 


ueU 


which clearly converges by the definition of Ro. 
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V_ Rationality of the zeta-function of a set of equations over a finite field 


Next, we consider the right-hand side of the equation in the lemma. First 
of all, for each i = 1, 2, ...,, we have 
> x = fe -— 1, ifg — 1 divides w, 


xen 0, otherwise. 


xg-te1 


(See Exercise 6 below.) Hence, 


> x =TT( > x) = {4 ”, itq ivides w 


xe0" | t \ye*Tly otherwise. 
Thus, 
> G)=S ey > w= @- 1" DS gee =@—- "Tr, 
x@-l=1 weU xd-l=1 ueU 


which proves the lemma for s = 1. 
Now suppose that s > 1. We have: 


WS = Tyo GoTyo Go 8-2 = Tyo Ty° Gao Go W8-? 
= T2°G-G,o YS"? = T,20 T,°(G-G,),G ° ¥s-3 
= T° G-G,-Gyzo ¥8-3 = ++. = Tyo G- Gy: Gyz +++ Gys-1 
SS Wate Ganseee 


Thus, replacing g by g° and G by G-G,-G,2--- G,s-1 gives the lemma in the 
general case. O 


If Ais anr x r matrix with entries in a field F, and if T is an indeterminate, 
then (1 — AT) (where | is the r x r identity matrix) is anr x r matrix with 
entries in F[T]. It plays a role in studying the linear map on F” defined by A, 
because for any concrete value te F of T, the determinant det(l — Af) 
vanishes if and only if there exists a nonzero vector v € F’ such that 0 = 
(1 — Adv = v — tA, i.e., Av = (1/2)v, in other words, if and only if 1/t is an 
eigen-value of A. If A = {a,;}, we have 


n 
det(l — AT) = > 6,7", 
m=0 


where 


bm = (—1)" ya SEN) Ay, (uy) Fug.c(ug) * °° Fug. o(tim)* 
1 <u, <+*'<u,, <r 
oa permutation of the u’s 

(Here sgn(c) equals +! or —! depending on whether the permutation o is a 
product of an even or odd number of transpositions, respectively.) 

Now suppose that A = {a,,}7°;., is an infinite “square” matrix, and suppose 
F = Q. The expression for det(1 — AT) still makes sense as a formal power 
series in Q[[7]], as long as the expression for b,, which now becomes an 
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3 A linear map on the vector space of power series 


infinite series (i.e., the condition “<r” is removed from the u,), is convergent. 

We apply these notions to the case when A = {g ,-uhu,veu iS the “matrix” 
of YW = 7,°G, where GE Ro, ie., ord, g, = M|w|. We then have the 
following estimate for the p-adic ordinal of a term in the expression for b,: 


Ord Pecan ay Soom aig" Beedini sual 

M [|go(us) — us| +|go(u2) — ve] +--+ +|go(m) — uml] 
M{> qlo(u)| — > ull = M@ —- 1) > ul. 

(Notice that, if G is a series in n indeterminates, then each uy, is an n-tuple of 


nonnegative integers: u, = (Wi, .--, Min), and |u,| = >. uj;.) This shows 
that 


IV 


IV 


ord, bn > 0 asSm->co 
and also that 


I 
im O8dr bm > 00 as m —> 00, 


The latter relationship holds because, if we take into account that there are 
only finitely many u’s with a given |u|, we find that the average |u| as we run 
over a set of distinct u;, i.e., (1/m) >7, |u|, must approach oo. 

This proves that 


det(l — AT) = > 6,7" 
m=0 


is well-defined (i.e., the series for each 6,, converges), and has an infinite 
radius of convergence. 

We now prove another important auxiliary result, first for finite matrices 
and then for {g,, -,}. Namely, we have the following identity of formal power 
series in Q([7]]: 


det(l — AT) = exp,(~ S THAT's). 


To prove this, we first recall from the theory of matrices (Herstein, Ch. 6) 
that the determinant and trace are unchanged if we conjugate by an invertible 
matrix: At> CAC“, i.e., they are invariant under a change of basis. More- 
over, over an algebraically closed field such as Q, a change of basis can be 
found so that A is upper triangular (for example, in Jordan canonical form), 
in other words, so that there are no nonzero entries below the main diagonal. 
So without loss of generality, suppose A = {a,;}/,;-1 is upper triangular. Then 
the left hand side of the above equality takes the form [][f-, (1 — aT). 
Meanwhile, since Tr(A’) = 57_, a5, the right hand side is 


oO r 


exp,( > > ay°T'/s = i exp»(— > (a7 


Ss=1li=1 


= [ Lexp,tlog,(t rast a,;T )) = a Cl = aT). 
t=% tei 
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V_ Rationality of the zeta-function of a set of equations over a finite field 


We leave the extension to the case when 4 is an infinite matrix as an 
exercise (Exercise 8 below). 
We summarize all of this in the following lemma. 


Lemma 4. If G(X) = dweu8wX” € Ro and ¥ = T, G, so that ¥ has matrix 
A = {8qy-ubv,ueu, then the series det(1 — AT) is a well-defined element of 
Q[[T]] with infinite radius of convergence, and is equal to 


expy| - > Tr(A5)TS/s \ 


4. p-adic analytic expression for the zeta-function 


We now prove that for any hypersurface H, defined by f(Xq, ..., X,) € 
F[X, ..., X,], the zeta-function 
2(A,/F; T) € ZIT] < QUI] 


is a quotient of two power series in Q{[[7]] with infinite radius of convergence. 
(Alternate terminology: is p-adic meromorphic, is a quotient of two p-adic 
entire functions.) 

We prove this by induction on the number n of variables (i.e., on the 
dimension n — 1 of the hypersurface H,). The assertion is trivial if n = 0 
(i.e., H; is the empty set). Suppose it holds for 1, 2, ...,m — 1 variables. We 
claim that, instead of proving our assertion for 


Z(H,/F,;T) = exp(>N.T‘/s), 


it suffices to prove it for 
Z(H|F 3 T) a exp( > NT), 
s=1 


where 
N, = 


iq number of (x, ..., Xn) € Fas such that f(y, ..., X,) = 0 


and all of the x; are nonzero 
= number of (x,,..., X,) € Fj. such that f(x,,..., x,) = 0 
and x?-2=1, i=1,...,n. 
How does Z‘(H,/F,; T) differ from Z(H,/F,; T)? Well, 
Z(H,/Fq; T) = Z'(H,/F,; T)- exp(>(N. — N,')T*/s), 


and the exp factor on the right is the zeta-function for the union of the n 
hypersurfaces H; (i = 1,...,n) defined by f(X,,..., X,) = 0 and X; = 0. 
Note that H; either is a copy of (n — 1)-dimensional affine space given in 
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4 p-adic analytic expression for the zeta-function 


Az, by the equation x; = 0 (this is the case if f(X ,,..., X,,) is divisible by X;), 
or else is a lower dimensional ((n — 2)-dimensional) hypersurface. In the 
first case we know its zeta-function explicitly (see Exercise 1 of §1), and in the 
latter case we know that its zeta-function is meromorphic by the induction 
assumption. Now the zeta-function for the union of the H; is easily seen to 
be the product of the individual zeta-functions of the H;, divided by the 
product of the zeta-functions of the overlaps of H; and H; (i # j)—i.e., the 
hypersurface in a copy of Ag, ? defined by X; = X; = Oandf(X,,...,X,) = 
O—multiplied by the product of the zeta-functions of the triple overlaps, 
divided by the product of the zeta-functions of the quadruple overlaps, and 
so on. But all of these zeta-functions are p-adic meromorphic by the induction 
assumption or by the explicit formula for the zeta-function of affine space. 
Hence, if Z’ is proved to be p-adic meromorphic, it then follows that Z is 
p-adic meromorphic as well. See Exercises 4—5 of §1 for a similar argument. 

Fix an integer s > 1. Let g = p’. Recall that, if t denotes the Teichmiiller 
representative of ae F,:, then the pth root of 1 given by «7? has a p-adic 
analytic formula in terms of t: 


e™2 — Q(t)O(t?)O(t?”) - - - O(t?"~’). 
A basic and easily proved fact about characters (see Exercises 3-5 below) is 
that 
Saas (a if ue Fy 
XoEF qs q, ifu = 0; 


and so, if we subtract the x) = 0 term, 


: x 
eTM(xou) = {7 1, ifue Eqs 


a 
2 q—-1, ifu=0. 


XoEE Zs 
Applying this to u = f(x,, ..., x,) and summing over all x,, ..., X, € F¢s, 
we obtain 


ell(Xof (xy Seesee xy) @N,’ oe (qs = 1)". 


Now replace all of the coefficients in Xof(X1, ..., Xn) € FalXo, Xa, ---> Xal 
by their Teichmiiller representatives to get a series F(X, X;, ..., X,) = 
Dh aX%e Q[X, X,...,X,], where X™% denotes Xy%oX M1 --- XM, 
wWfe= (Wios Wits +++ Win). 


We obtain: 
q°N; = (qs = 1)" + > eTt(Xofl%yr...s x,)) 
XoXy A xn€E ys 
N 
=@= 1p > 2 [ [ oG@x0(@?x™)--- 
XoXo .-e xneQ i=1 
cS es ae 


: O(a?" - iyers - +»), 
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Note that, since /(X,, ..., X,) has coefficients in F,, q = p’, it follows that 
a," = a;. Now let 
N 
GX oa X,) za II @(a,X") O(a? XP") --- @(aP~ xP" *%), 
i=l 
so that 
GN. = (q — 1) + > G(x): G(x4)- G(x") «+ Gx"). 


Since the series O(a,” X""”:) are each in Ry (see Exercise 2 below), so is G: 
G(X, ..-, Xi) E Ro | QLNX, ..., Xa]. 
Hence, by Lemma 3, we have 
GN, = (gq — 1)" + @ — I)** Tr), 
ie., 


= den(ia smi D Xe »(" 3 Z ‘Ya sn -O TP), 
If we set (recall: A is the matrix of ‘V) 
A(T) = det(l — AT) = exp,{ — > THT} 
we conclude that 


Z'(H,|F 4; T) = exp, {dN T's} 


= [I ex p> goeorys\ ON, 
1=0 


n+1 «o c-v("7") 
x CT exo. > gg) TrC¥S)T s/s }| 
i=0 


7 n+1 
=[[a- giorno OT] AQT) PICT), 
i=0 eG 


By Lemma 4, each term in this “alternating product” is a p-adic entire 
function. 

This concludes the proof that the zeta-function is p-adic meromorphic. 
This result is the heart of the proof of Dwork’s theorem. In the next section 
we finish the proof that the zeta-function is a quotient of two polynomials. 


EXERCISES 


1. Let ¢e Q be a primitive (p* — 1)th root of 1. Prove that the conjugates of ¢ 
over Q, are precisely: 1, 1”, t?”, ...,¢?°"*. In other words, the conjugates of 
the Teichmiiller representative of a ¢ F, are the Teichmiiller representatives of 
the conjugates of a over Fp. 
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2. Let XY = X,"--- X,"s, and let ae D(1). Prove that O(aX”) € Ro. 


3. Let ai, ..., 0, be distinct automorphisms of a field K. Prove that there is no 
nonzero linear combination > a,c, such that > a,o,(x) = 0 for every xe K. 


4. Let ee © bea primitive pth root of 1. Prove that Sxep, e7* = O. 
5. Prove that 

y err = i 1, if € Fj. 

xek g—1, ifu=0. 

6. Prove that for any positive integers n and a, 


hy if n divides a; 
0, 


len, De J otherwise. 


7. Prove that Go 7, = T,° G, in the notation of §V.3. 
8. Extend the result 


det(1 — AT) = exp»( 5 TH(ANT*/s) 


to infinite matrices A, with a suitable hypothesis on convergence of Tr. 


9. A review problem. Let f(X) = Seo a:X'€ Fa[X] be a polynomial in one 
variable with coefficients in F,, gq = p’, and nonzero constant term. We are 
interested in the number WN of distinct roots of f(X) in F,. For each i = 0, 
1, ..., 2, let A, be the Teichmiiller representative of a,. Let « = 1 + A bea 
fixed primitive pth root of 1 in Q, and define ©(T7) as in §V.2. Let 


nor-l 
G(X, Y) = I i O(A?’ X'?! YP’), 
Prove that 
wetel,) y G(x, y). 
q x. yen 
x¢-ley¢-leal 
5. The end of the proof 


Dwork’s theorem will now follow easily once we prove the following criterion 
for a power series to be a rational function. 


Lemma 5. Let F(T) = >.> aiT' € K[[T]], where K is any field. For m, s > 0, 
let A, m be the matrix (a, +:4;$0 si.jsm: 


as as+1 as+2 sts Asam 
Gs41  As+a as+3 ttt Ase met 
4542 Asia as44 ttt As+m+2 9? 
Asim Ast+m+1 G@stm+2 °'* OAstom 
and let Nom = = det(A, m). Then F(T) is a quotient of two polynomials 
P(T 
F(T) =£9, pr), (rye KIT}, 


ary’ 
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if and only if there exist integers m > O and S such that N,,m = 0 whenever 
s>S&. 


Proor. First suppose that F(T) is such a quotient. Let P(T) = 5”, 6,T', 
Q(T) = >%.0 GT'. Then, since F(T)- Q(T) = P(T), equating coefficients of 
T' for i > max(M, N) gives: 


N 
> Gy 4;Cv—; = O. 
nh 


J 
Let S = max(M — N+ 1,1), and let m= AN. If s > S, we write this 
equation fori=s+N,s+N+1,...,5 + 2N: 


QCy + As4iCy-1 +++ + Asano = O 
Gs41Cn + As40Cn—-1 +°°* + As4n4i10o = 0 
OsanCn + Asan4i0n—-1 + °°* + AstanCo = 0. 


Hence the matrix of coefficients of the c;, which is A, y, has zero determinant: 
Nem = Nov =O fors 2S. 


Conversely, suppose that N,, = 0 for s = S, where m is chosen to be 
minimal with this property that NV, , = 0 for all s larger than some S. We 
claim that N,n-1 # 0 for alls > S. 

Suppose the contrary. Then some linear combination of the first m rows 
los Vis ++ +9 fm—1 Of Ag m Vanish in all but perhaps the last column. Let r,, be 
the first row having nonzero coefficient in this linear combination, i.e., the 
igth row r;,, can be expressed as 


Osligta + Cligg2 0+ + Om-ig-i%m-1 


except perhaps in the last column. In A,,,, replace r,, by r,, — (@ifige41 t-°° + 
Om -ig-1'm-1), and consider two cases: 


(1) ip > 0. Then we have a matrix of the form 


as as +1 eye: Qs +m 


As+m+1 


As+m Asim me Qs+42m 
and the boxed matrix has determinant N,.4.m-1 = 0. 


(2) io = 0. Then we have: 


' 
! 
' 
' 
i As42 AWsim+i }, 
' : : 
1 


As+m+1 ds42m 
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Now N,1.m-1 is the determinant of either of the boxed matrices. Since the 
determinant AN, ,, of the entire matrix is 0, either the determinant of the lower 
left boxed matrix is 0, or else 8 = 0, in which case also Ny41.m-1 = 0. 

Thus, in either case N,41.m-1 = 0, and, by induction, we may obtain 
Nym—1 = 0 for all s’ > s. This contradicts our choice of m to be minimal. 

But then for any s => S we have N, m = O and N,m-1 # 0. Hence there is 
a linear combination of the rows of A,, which vanishes, in which the co- 
efficient of the last row is nonzero. Thus, the last row of A,,,, foranys > Sisa 
linear combination of the preceding m rows. Hence any simultaneous 
solution to 


sum, + As41Um-1 +++ + As4mllo =0 


As4m—-Um + As4mUm—1 + °°* + As+am-1lo = 9 
is also a solution to 
Asimlm + Agemsilm-1 +++ + Asiamlo = 0, 
and, by induction, to every 
Am + As4iUm—-1 + +++ + AsimUo = 0 


for s = S. This clearly implies that 


(Sux). (Sax) 
1=0 i=l } 


is a polynomial (of degree < S + m). oO 


We now use Lemma 5 to prove the theorem. We must make use of the 
‘‘p-adic Weierstrass Preparation Theorem” (Theorem 14, §1V.4). In the form 
we need, it says that, if F(7) is a p-adic entire function, then for any R& there 
exists a polynomial P(T) and a p-adic power series F,(T) € 1 + TQ[[T]] which 
converges, along with its reciprocal G(T), on the disc D(R) of radius R, such 
that F(T) = P(T)-F,(T). Namely, in Theorem 14 let p* = R; since F is 
entire, it converges on D(p’). 

For brevity, let Z(7) = Z(H,/F,; 7). We know from §4 that we can 
write Z(7) = A(T)/B(T), where A(T) and B(T) are p-adic entire functions. 
Choose R > q"; for simplicity, take R = q?". If we apply the fact in the last 
paragraph to B(T), we may write B(T) = P(T)/G(T), where G(T) converges 
on D(R). Let F(T) = A(T)-G(T), which converges on D(R). Thus, 


F(T) = P(T)-Z(T). 


Let F(T) = S20 6,7T'€1 + TOL[[T]], P(T) = df20 GTie 1 + TQ[T], Z(T) 
= >2 .47'e 1 + TZ[[T]}]. By Lemma 2 of §1, we have 


\ailoo < qi". 
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Since F(T) converges on D(R), we also have for i sufficiently large: 
lp < Rh = go ™, 


Choose m > 2e. Then fix m. Let Asm = {As+i+j}0<i,j<m, a8 before, and 
Ny.m = det(As,m). We claim that for our m we have N,.m = 0 for s sufficiently 
large. By Lemma 5, this claim will imply that Z(T) is a rational function. 

Equating coefficients in F(T) = P(T)Z(T), we have: 


Dyee = Ase + CIGjre-1 + Cojre-g H+ + CpG. 


In the matrix A, ,, we add to each (j + e)th column—starting from the last 
and moving left until the eth column—the linear combination of the previous e 
columns with coefficient c, for the (j + e — k)th column. This gives us a 
matrix whose first e columns are the same as in A, », the rest of its columns 
have a’s replaced by the corresponding b’s, and which still has determinant 
N,,m- We use this form of the matrix to estimate |N, mlp- 

Since a,¢Z, we have |a|, < 1. Thus, |Memlp < (max;o5+elD;|p)™*17° < 
R-s™+1~-©) for s sufficiently large. Since R = q?", and m > 2e, this gives us: 
|Ns.mlp < Grint. 

On the other hand, a crude estimate based directly on A, m gives: |Ns.mlo < 

(m + 1) lqr’st 2mon +) a (m+ 1) Iq2nmen + Dgnsim +1), Multiplying together these 
two estimates, we see that the product of the p-adic norm and the usual 
absolute value of Nm is bounded by an expression which is less than | for s 
sufficiently large: 
(m+ 1) !g2™mm +) 
a 
for s sufficiently large. But N,., € Z, and the only integer n with the property 
that |n|-|n|, < lism = 0. Hence N, _ = 0 for s sufficiently large. 


Therefore, Z(T) is a rational function, and Dwork’s theorem is proved. 
| 


|Ne.mlp'|Ns,mlo < q7 sm +2). mm + 1) g20mm t Dgnsim + D = <1 
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Answers and hints for the exercises 


CHAPTER I §2 


3. 


Write |x + yl|¥ = {I(x + y)%|], use the binomial expansion and Property (3) of a 
norm to get an inequality for ||x + y||¥ in terms of max(||x|[, |[y{}), then take Nth 
roots and let N > oo. 


If xe F has the property that ||x|| < 1 and ||x — 1}| < 1, and if |) || is non-Archi- 
medean, then 1 = |[1 — x + x|| < max(||x — 1], ||x|]) < 1, a contradiction. Con- 
versely, suppose that || || is Archimedean. Then by definition, there exist x, ye F 
such that ||x + y|| > max(|jx|l, llyl]). Let « = x/(x + y), and show that ||«|| < 1 and 
la — 1} <1. 


. Suppose || ||, ~ || ||2. Let ae F be any nonzero element with |la||, 4 1, say |lall. 


> 1. Then there is a unique « such that ||a||, = |la|l3. Claim: ||x||, = ||x||$ for all 
x € F. If, say, there were an x with ||x||, > ||x/]3 (also suppose that ||x||, > 1), then 
choose large powers x” and a” such that ||x”/a"||, approaches 1; but then show that 
||x"/a"||, approaches 0, and hence the two norms are not equivalent. Finally, note 
that « > 0, or else we would not have || ||, ~ || ||2. (The converse direction in this 
exercise is easy.) 


. If p = 1, you get the trivial norm. If p > 1, you don’t get a norm at all; for example, 


choose N large enough so that p% > 2 and take x = 1, y = p% — 1. Thencheck that 
prrde(x+y) > pede + porary, 


. The sequence {p,"} approaches 0 in | |,, but not in | |,.. 


The hardest part is to prove the triangle inequality for | |*, « < 1. By supposing 
|x| > |y| and setting u = y/x, reduce to showing that 


-Il<u<il implies jl+ul?< 14 [ul 
which is true if 
O<u<il implies f(u)=1+u*—-(1+u*>0. 
Since f(0) = 0 and f(1) = 0, this follows by showing that f’(u) 4 0 on (0, 1). 
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9. Use Exercises 2 and 3: if ||n||,rch > 1, then the sequence {1/n'} approaches zero in 
ll llArch but not in | lnon-Arch: 


10. Show that the least positive integer of the form nN + mM must be a common 
divisor of N and M. 


11. 3, 7, 1, 1, 7, —2, 0, 0, 3, 2, —2,0, -1, -1,4. 


12-14, Prove the lemma: ord,(n!) = >[n/p'], where [ ] is the greatest integer function 
and the sum is over j > 1 (note that this is only a finite sum). 


15. 1/25, 25, 1, 1/25, 1/243, 1/243, 243, 1/13, 1/7, 1/2, 182, 1/81, 3, 2%~2™, 1/2. 
16. p does not divide its denominator when x is written as a fraction in lowest terms. 


17. Use Exercise 14. 


19. Use the “diagonal process,” as follows. Choose an infinite subsequence of integers 
with the same first digit, an infinite subsequence of that with the same first two digits, 
and so on. Then take the subsequence consisting of the first element from the first 
subsequence, the second element from the second subsequence, ..., the ith element 
from the ith subsequence, .... 


CHAPTER I §5 


Lp =Gs,)0 Pet aye 
+ (p—1—a))+(p—1-—a4,)p+--. 
2 (i) 44+0-741-7 45-7? 
(ii) 2+0-5+41-5? 43-59 
ii) 8-117'+849-1145-11? 
(iv) 1-2 4+ 1-2? +0-23 (the bar denotes repeating digits) 
(vy) 14+1-74+1-77 
(vi) 10+9-11 49-11? 
(vii) 10 + 0-13 + 4-13? + 7-133 
(viii) 2-5-3 +. 4-577 41-57 
(ix) 2-37 + 2-33 + 2-34 + 2-3° 
(x) 2-37 + 141-3 
(xi) 1-273 + 1-277 40-27! 
(xii) 4.57! + 443-5 
4. To prove that a/b € @ has repeating digits in its p-adic expansion, first reduce to the 
case p{b. Then first suppose that a/b is between 0 and —1. Multiply the denomi- 
nator and numerator of a/b by some c which gets the denominator in the form 
cb = p’ — 1forsomer.Letd = —ac,sothat0 < d < p’ — 1.Thena/b = d/(1 — p’); 
now expand as a geometric series. You find that a/b has a “purely” repeating 
expansion. If a/b is not between 0 and —1, then it is obtained by adding or sub- 
tracting a positive integer from a purely repeating expansion, and the result will 
still be a repeating expansion once you’re past the first few digits. An alternate proof 
is to show that in long division you must eventually get repetition in the remainders. 


5. The cardinality of the continuum. You can construct a one-to-one correspondence 
f between Z, and the real numbers in the interval [0, 1] written to the base p by 
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setting f(dy + ap + +++ + a,p" +--+) = do/p + ay/p? + +>» +.4,/p"* > +---. f 
is not quite one-to-one, since a real number in (0, 1) with a terminating expansion 
has two preimages; for example, f(1) = f(—p) = 1/p.) 


72140-241-274+0-2341-244---. 
8. (i), (iii), (iv), (v), (ix). 


9.241-542-57 41-5 4---,343-542-57 43-53 4--5245-74+0-77 
+6-Pee-,541-74+6-74+0-PH es. 


10. 5, 13, 17. 


11. Let a,¢€Z; be any number which is a square mod p, let «,¢Z} be any number 
which is not a square mod p, and let a3 = pay, a4 = pa. 


12. Take, for example: 1, 3, 5, 7, 2, 6, 10, 14. 


. 


13. In Q, we have +1 and the two square roots of —1 found in Exercise 9. To prove 
the general fact, use Hensel’s lemma for each ay = 0, 1, ..., p — 1 with F(x) = 
xP — x, 


14. See Herstein’s Topics in Algebra, p. 160 (where it’s proved for polynomials with 
integer coefficients; but the proof is the same with p-adic integer coefficients). 


15. If there were such a pth root, then the polynomial (x? — 1)/(x — 1) would have a 
linear factor. But substituting y = x + 1 leads to an Eisenstein polynomial, which 
is irreducible by Exercise 14. To give the second proof, notice that (1 + p’x’)? = 
1 + p’*1x’ + (terms divisible by p?”**), and this cannot equal 1. 


16. Note that 1/1 — p)— (1+ p+ p? +--+: + p%) = p**/(1 — p). The other two 
series converge to 1/(1 + p),(p? — p + 11 — p?). 


17. (a) More generally, in place of p' one can take any sequence p,;€Z, such that 
ord,(p;) = i. Namely, show by induction on n that the map 


ie SUMS dp) Po + +++ + Aq—1Pn—1 


0,1,2,...,p"-—1 
with varying digits a; } at 4 } 


obtained by reducing modulo p", is one-to-one. 
(b) We have 
-(-1) > pisagt-:-+a,-(-p"'<(-1) D> ps 
i<n,iodd i<n,ieven 
since there are 1 + (p — 1) >;<, p' = p” integers in this interval, by part (a) each 
such integer has exactly one representation in the form ag + --- + a,—-,(—p)"'. 


18. Use Hensel’s lemma with F(x) = x" — a (or ax™"— 1 if n<0) and a) =1. 
1 + p has no pth root. Next, if a= 1+ a,p? +---, then to find a pth root let 
ay = 1 + a,pand apply Exercise 6 with M = 1, F(x) = x? — a. 


19. Use induction on M to prove the congruence: if «?“~' = a? ~* + pM—18 for some 
B (this is the induction assumption), then raising both sides to the pth power gives 
the desired congruence. Then show that the limit as M > oo has the properties (1) 
its pth power is itself, and (2) it’s congruent to « mod p. 


20. Use the same idea as in Exercise 19 of §2. 
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21. Look for X = Ag + pA, + p?A, +--+, where the A; are r x r matrices whose 


entries are p-adic digits 0, 1, 2,...,p — 1. Let X, = Ag +--- + p"A,. We want X,, 
to satisfy X? — AX + B = 0 mod p"*! (where we use the congruence notation for 
matrices to denote entry-by-entry congruence). Use induction on n. When n = 0 we 
obtain the congruence by choosing Ag = A mod p. The induction step is: 
(Xn-1 + p"A,)* “Ss A(X, -1 + p"A,) +B 

= (X?_, “= AX,-1 + B) + D(A WX n—1 + Xn-1An re AA,) 

= (Xp-1 — AX,-1 + B) + p"A,A (mod p"**), 
because X,,_, = A (mod p). Choose A, modulo p to be 

—(X?_, — AX,-, + B)p""A7}. 

Notice that this argument falls through for higher degree polynomials because of 
the noncommutativity of matrix multiplication. 


CHAPTER II §2 


1. 


Expand 1/(1 — q~*) in a geometric series; then multiply out, and use the fact that 
every positive integer n can be written (uniquely) as qj! - - - q?". 


. Define f(t) = 3t + the’ — 1) and show that f(t) is an even function, i.e. f(t) = f(—1). 
. For large k, ¢(2k) is near 1. Answer: 4./mk(k/ne)**. 


(i) modulo 5° we have (1 + 2-5)1(625+1-25) = (1 4. 2-5)'+9? = 11-(1 + 2-5)” 
=142-540-5742-53 42-54; 

(ii) modulo 35 we have (1 + 32)7¥? =(1 + 32)'*3*¥? =140-341-37 4 
1-33 +1-34; 

(ii) 14+5-743-P42-P42-Hh4--. 


. Note that p” = 1 modulo p — 1 for any N, so that, if you first approximate a given 


p-adic integer by the nonnegative integer obtained from the first N places in its p-adic 
expansion, you can then add a suitable multiple of p” to get a positive integer con- 
gruent to sy modulo p — 1 which is an equally good approximation. 


» Ll) = 0/4; LAs) = TT 1/1 — x(@)/9"). 


CHAPTER II §3 


2. An example is the complement of a point. 


5. It suffices to prove this for U = a + (p"), since any U is a disjoint union of such sets. 


Let a’ be the least nonnegative residue mod p" of aa; then since |«|, = 1, it follows 
8 Pp P 
n 


that «U = a’ + (p”), and so both U and aU have the same Haar measure p™". 


6. (1) the first digit in «; (2) (p — 1)/2; (3) £224 a(a/p — 1/2) = (p? — 3p + 2)/12. 


CHAPTER II §5 


1. For the first assertion, compare coefficients of ¢* in the identity: te*/(e’ — 1) = 


(> B,t'/k)(> (tx)4/j!). To prove the second assertion, take f. --- dx of both sides of 
the identity > B,(x)t*/k! = te™/(et — 1) and compare coefficients of t*. To get the 
third assertion, apply (1/t)(d/dx) to both sides of the same identity. 
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2. Clam: »(U) = 0 for any U if u has this property. Since U is a union of sets of the form 
a + (p%) for N arbitrarily large, we have for such N: |H(U)|, < max,|u(a + (p"))|,- 
Now let N > oo. 


3. B,; p*~1B,;(1 — p*~*)B,. 

5. (1 — 0 )B,; (1 — (1 — pI) Bs Sto ad — YL — p)Bis 1/G + 2D). 

7. Use the corollary at the end of §5 with g(x) = 1/(the first digit in x), so that f(x) = 
g(x) mod p, and g(x) is locally constant. 

9. lim > a/p™*} 


N>o O<sa<pN 
N=2M+1 ag=ag+azp2+---+a2mp?™ 


p-1 
= lim weer ("3 2% + p™ os a,p? +--- + pM = ou™) 


a2=0 a2m=0 


p-1 2 4 1 
=f q 2) see 
5) (1 + p?+p*t+--:) Ap +1) 


CHAPTER II §7 


2. (i) Use the Kummer congruence (1 — 5?~')(—B,/2) = (1 — 51977 !)(— By2/102) 
modulo 5? to obtain 1/3 = — By 92/102 (mod 53), and hence B,o, = 1 + 3-5 
+ 3-59 4-5, 
(ii) From the congruence (1 — 7?~1)(— ee) = (1 — 779° !)(— By 96/296) (mod 
7°), obtain Bygg =6+6-743-7 +--- 
(iii) Use (1 — 74-!)(—B,/4) = (1 — 7592 ve — Bso3/592) (mod 73) to obtain Bsgs 
=34+4-743-PH--. 


3. Recall that a rational number belongs to Z if and only if for each p it isin Z,. Then 


use parts (1) and (3) of Theorem 7. 


6. Let a= 1+ p? =5, and let g(x) = (ay + 2a,)~1, where ag, a, are the first two 
2-adic digits in x. Then follow the proof given for odd p. In the case p = 2, the 
Clausen-von Staudt theorem says that every nonzero Bernoulli number starting 
with B, has exactly one power of 2 in the denominator. 


CHAPTER III §1 


1. All roots of X?”~1 — 1 are also roots of X’’~! — 1 if and only if X” ~! — 1 divides 
X”-1! _ 1; this is true if and only if p” — 1 divides p’ — 1, which, in turn, is 
equivalent to f’ dividing f- 


2. Here is a table of all possible generators of F; : 


7 11 


possible a 
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. (1 + fyeny 044 Power is a generator. 


. Adjoinarootjof X? + X + 1 = Oand X? + X + 1 = 0,respectively. For example, 


in F, we multiply as follows: 
(a + bj + of*\d + ef + fj?) = (ad + bf + ce) + (ae + bd + bf + ce + cfij 
+ (af + be + cd + cf)j’. 


Finally, note that when q — 1 is prime, any element (not 1) of Fj is a generator. 


. Clearly F,(a) = F,. 


. If any two of the o; were the same, you would get a polynomial of degree less than q 


having q roots. 


. If P(X) factored over F,, say, P(X) = Py(X )P2(X) with deg P, = d < p, then the 


coefficient of X4~! in P,(X), which is minus the sum of d of the roots a + i, would 
bein F,. But then dae F,, and soae F,. But all elements of F, are roots of X? — X, 
and so cannot be roots of P(X). 


8. If p = 2, then —1 = 1 and it’s trivial; otherwise F, contains a square root of —1 
if and only if 1 has a primitive fourth root, ie., if and only if 4 divides q — 1. 

9. Assume the contrary, and use the same approach as in Exercise 19 of §1.2 and 
Exercise 20 of §1.5. 

10. First show, without using limits, that the formal derivative of a polynomial over any 
field obeys the product rule. This can be done quickly by using linearity of the 
derivative to reduce to the case of a product of two powers of X. 

CHAPTER III §2 


A good reference for the ideas in these exercises (especially Exercises 2, 6, 7, 8) 


is 
3. 
4. 


5. 


Chapter IV of Simmons’ textbook (see Bibliography). 


02°02 = pry, but lleallsup * lv2llsup =1, P01 lsup = IPlp = 1/p. 


Let F, = A/M be the residue field of K, where q = p’; it is an extension of degree f of 
the residue field F, of Q,. In the proof of the last proposition we saw that f <n = 
[K:Q,]. (In the next section we'll see that f divides n; e =n/f is called the ramifica- 
tion index.) First suppose that K is unramified, i.e., f =n. If we let x denote 
the image of x under the residue map A->F,, we see that we can choose a basis 
{v,,...,0,} of K over Q, such that {0,,...,0,} is a basis of F, over F,. One 
now checks that the sup-norm with respect to such a basis has the desired 
multiplicative property. Namely, first prove that x € K we have: ||x\lup < 1 if and 
only if x€ A; ||xllup <1 if and only if xe M. Then to show that ||xyllup = 
Ixlleup ‘ly llsup» Feduce to the case when |XIlup = ll¥llsup = 1, Le., x, ye A-— M. 
But then xy ¢ A — M. Conversely, if K is ramified, the sup-norm is never a field 
norm. Namely, one can show that in a field norm K has elements with norm a 
fractional power of p. 


Any element x € Z, can be written in the form x = p"u, where u is a unit. 
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CHAPTER III §4 


1. 


The values of | |, on Qare the same as on Q,, since any element can be approximated 
by an element of @,. To show that, for example, the unit ball in Q, is not sequentially 
compact, take any sequence of distinct roots of unity of order prime to p and show 
that it has no convergent subsequence. 


. Let ro = |b — al,. You get the empty set unless r is a sum r, + r, of two rational 


powers of p (or zero). Then consider cases depending on the relative size of ro, r1,1r2- 
For example, if ro = r; > rz, then you get the two disjoint circles of radius r, about 
aand b. The “hyperbola” has exactly the same possibilities; now r = r; — rz must 
be a difference of two rational powers of p. 


. Let C, = max(1, Co). Suppose B is a root with |B], > C,. Then B = —b,_, — 


b,-2/B — +++ — bo/B"~', and |B|, < max(|b,—;—1/B'|p) < max(|b;|,) = Co, a con- 
tradiction. 


. Set 5 = min|« — «,|,, where the minimum is over all roots «; # « of f. Use the last 


proposition in this section with the roles of 5 and ¢ reversed to find a root f of g such 
that |« — B|, <6. By Krasner’s lemma, K(a) < K(f). Since f is irreducible, 
[K(a): K] = deg f = deg g > [K(B): K], and hence K(a) = K(f). As a counter- 
example when f is no longer irreducible, take, say, K = Q,, f(X) = X7, « = 0, 
g(X) = X? — p?%*! for large N. 


. Let a be a primitive element, ie., K = Q,(a), and let f(X) ¢ Q,[X] be its monic 


irreducible polynomial. Choose ¢ as in Exercise 4, and find g(X) € Q[X] such that 
| f — g|, < &. (For example, take the coefficients of g to be partial sums of the p-adic 
expansions of the corresponding coefficients of f) Then g has a root f such that 
K = Q,(f) > Q(), and it’s easy to see that F = Q(f) is dense in K and has degree 
n= [K:Q,] over Q. 


. Set a = ./—1, B = ./ —a (with any fixed choice of square roots). We can apply 


Krasner’s lemma if either |B — «|, or |B — (—a)], is less than |« — (—«)|,, which 
equals 1 if p # 2 and 1/2 if p = 2. Since 

la = 1], = |-a = (-)DI, = |B a|,1B + |p» 
this holds if |a — 1|, < 1 for p #2 and <1/4 for p = 2. To do the next part, set 
a= Jp p= aft: Then it suffices to have either |B — «|, or |B — (—a)|, less than 
\2/pl,- Since |a — p|, = |B — a|,'|B + al,, this holds if ja — pl, < |4p|,. So 
choose ¢ = 1/pif p #2 and = 1/8 if p = 2. 


. First note that a satisfies the monic irreducible polynomial (X?"~!)/(¥?"~! ~— 1), 


(For the case n = 1, see Exercise 15 of §1.5.) Now let B = (—p)!/?" ie, Bisa 
fixed root of X’~'! + p = 0. Leta, =a — 1,a, =a? —1,...,a,-, =a?! — 1 be 
the conjugates ofa — 1. Check that |; — o,;|, = p'?~  foranyi # j. By Krasner’s 
lemma, it suffices to show that |f — a;|, < p” '“?~") for some i. If this were not the 
case, we would have p~? <[[?=;' |B — «|, = |((B + 1)? — 1)/B|,, since TI(X — a) 
= ((X + 1)? — 1)/X. Now use the relation B?~' + p =0 to obtain: ((6 + 1)? — 1)/B 
= B-¥?=7 (?)B'” 7. But the p-adicnorm of thisis bounded by| pf |, < p~ '. To prove the 
last assertion in the exercise, suppose that a is a primitive mth root of 1, with m not 
a power of p, and |a — 1|, < 1. Then we would have ja‘ — 1|, < 1 for any i. Let 
| # p bea prime factor of m, and let b = a™’, which is a primitive /th root of unity. 
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10. 


11. 
12. 


13. 


14. 


15. 


Then f = b — 1 satisfies: |B|, < 1, and at the same time 0 = ((B + 1)' — 1)/B = 
Die2 DB! + 1 But then ||, = [B(— dj=-2 OB'~?)|, < 1, a contradiction. 


Let 2 be an element of K with ord, = 1/e, where e is the ramification index of K. 
Then {z'};-0,1,--.m—1 are in distinct multiplicative cosets modulo (K”*)”, and any 
element of K* can be written uniquely as 2'*™u for some 0 <i < m,jeZ,ueK 
with |u|, = 1. Now show that u is an mth power. Namely, since its image in the 
residue field F, is an mth power, we can find uy such that « = u/uo” — 1 satisfies 
ja|, < 1. Finally, write the p-adic expansion 1/m = ag + a,p + a,p* +--+, and 
obtain 


u=uo(1 + a) = (uo(l + tot arp + ap? + ~ym 


Otherwise, K would have residue degree f > 1, since it would have more than p — 1 
roots of unity of order prime to p, and any two such roots have distinct residues 
in Fos. 


All have the cardinality of the continuum. 


Let y,,..., yy be elements in K such that |y,|, = 1 and the images of the y, in the 
residue field form a basis of the residue field over F,. Show that ywisi<f, 
0 <j <e — 1, forma basis of K over Q,, where ord, x = I/e. 


If B satisfies the Eisenstein polynomial X° + a,_,X°~1 +--+ + do, set a= —ap. 
Then «eZ,, ord,« = 1, and B° — a = f° + ay = —a,_ f°" — --- — a,B has 
p-adic norm less than 1/p. 


Follow the proof of Theorem 3 of Ch. I but working over the field K, with B playing 
the role of p (recall that ord, B = 1/e). Note that ord,(B° — «) > 1 + 1/e. Look for 
B + a,B? with a, {0, 1,..., p — 1} such that ord,((B + a) B)° — a) > 1 + 2/e, 
and so on. An alternate method is to note that |a/B° — 1|, < 1, then write the p-adic 
expansion for t/ee Z,, and compute B(a/B*)'/" € K; this will be an eth root of a. 
Finally, we have K = Q,(f) because £ has degree e. 


Let V < Z,[X] be the set of monic polynomials of degree n. For f, ge V define 
| f — gl, by the sup-norm. Note that V looks like Z> with the sup-norm, and it is 
compact. Let S < V be the subset consisting of irreducible polynomials. Any such 
polynomial gives at most n different degree n extensions of Q, in O p- For fixed fe S, 
the last two propositions in §3 show that there exists 6 > 0 such that any g € S with 
| f — gl, < 6 gives precisely the same set of degree n extensions as does f. By com- 
pactness, the set S has a finite covering of subsets in each of which the polynomials 
give the same extensions. So there are only finitely many extensions of degree n. 


16-17. See the article “Algebraic p-adic expansions” by David Lampert, to appear in 
the Journal of Number Theory. 
CHAPTER IV §1 
1. (i) D(p"/~”~); (ii) D(co) = alll of Q; (iii) D(p~); (iv) DC); (v) DG); (vi) DU); 


(vii) DU). 
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A counterexample for the last question: let f; = 1 + pX if jis a power of 2, and let 
fj = 1 otherwise. Then f(X) = [To (1 + pX”) = 320 p¥X4, where S; denotes 
the sum of the binary digits in j; this does not converge on D(1). 


Let d(x, Z,) = min{|x — yl, lye Z,}, ie. the “distance” from x to Z,. F irst prove 
that if d(x, Z,) > 1, then the series converges, in fact, it converges under a much 
weaker condition on |a,|,. Now suppose that d(x, Z,) = r < 1. Choose M so that 
p “+” <r < p-™. Then for n = p% with N > M, show that: p™ — p%~! of the 
factors in the denominator of the nth term are of norm 1, p”~' — p*~? of the factors 
are of norm 1/p, p’~? — p%~? of the factors are of norm 1/p, and so on, and finally 
p’-™ — p¥-M~-! of the factors are of norm 1/p™”, and the remaining p’~™~! 
factors are of norm r. Use this to give a lower bound for the ord, of the nth term, 
namely, : 


ord, a, + ord, n! — (p’~! — p%~?) — 2(p%~? — p¥~3) —-.- 
_ M(pN-™M ee, p’-M~}) ee (M + 1)p’~¥-1 
=ord,a, + ord,n! —n(p' +p ?7+---+p ™4) 
= ord, a, + (n/p™*! — 1)(p — 1). 


The more general case when n is not a power of p involves the same sort of estimate. 
In all cases, one finds that ord, of the nth term approaches infinity. 


. For p > 2, write the congruence in the form ((1 + 1)? — 2)/p = 0 (mod p), use the 


binomial expansion for (1 + 1)? to get S?=} (p — 1(p — 2)---(p —j + Dj! on 
the left, and consider each term modulo p. 


. Leta = log,(1 — 2) = —lim,..,, >", 2'/i. Now 2a = log,((1 — 2)?) = log, 1 = 0; 


hence, a=0. Thus, ord, £2_, 2'/i= ord, 22,4, 2'/i> min;,,4,(@—ord, i). For 
example, for n = 2” this minimum is n + 1. 


8-9. Non-theorem | is being used. 


10. 


11. 


(a) The real series for the square root converges to the positive square root (m + p)/m; 
the p-adic series converges to the square root which is congruent to 1 modulo p. Here 
they’re both the same. 

(b) Here the positive square root is (p + 1)/2n = (p + 1)(p — 1), but this is the 
negative of the square root which is =1 mod p. 


(d)-(e) Consider the following cases separately: 

Case (i). a — b is divisible by an odd prime p. Note that, since a and b are positive, 
relatively prime, and not both 1, it follows that a + b > 3 is divisible either by 4 or 
by an odd prime p, # p. Then (1 + «)!/? converges to a/b p-adically and converges 
to —a/b either 2-adically (if 4 divides a + b) or p,-adically (if p, divides a + b). 
Case (ii). a — b = +2’,r = 2. Note that in this case there must be an odd prime p 
dividing a + b. Then (1 + «)'/? converges to a/b 2-adically and to — a/b p-adically. 
Case (iii). a — b = +2, so that « = (a/b)? — 1 = 4(+b + 1)/b?. Note that here a 
and b are both odd. Then (1 + «)'/? converges to —a/b 2-adically and to a/b in the 
reals, provided that —1 < « < 1. The latter inequality holds unless b = 1, a = 8 or 
b = 3,a = 16/9. 

Case (iv). a — b = +1, so that » = (+2b + 1)/b?. Then (1 + «)!/? converges to 
—a/b p-adically for p a prime dividing + 2b + 1, and it converges to a/b in the reals 
unless b = 1,a@ = 30rb = 2,0 = 5/4. 
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12. 


13. 


14. 


15. 


The series (x d/dx)*1/(1 — x) = > n*x" represents a polynomial in Z[X] divided by 
(1 — X)**! in its disc of convergence, in both the real and p-adic situations. In 
particular, for x = p you get an integer divided by (p — 1)**!. 


The left side is —log,(1 + 7%) = —log,(32); the right side is 
—2 logs(1 — 3) = —log3((—3)*) = —log3(32). 


As an example where the exact regions of convergence differ, take f(X) = > X?". 
Then f(X) converges on D(1~) and f’(X) converges on D(1). 


(a) For example, > 2%, i!/p}, where p; denotes the ith prime. (b) I don’t know of an 
example, or of a proof that it’s impossible. 


17. For each rational number r = a/b € Q, make a choice of p" € Q, i.e., choose a root of 
x’ — p* = 0 and denote it p’. Now take, for example, f(x) = p&"?”. 

18. No. 

20. If you want each coefficient to j places, choose N so that p” > Mp’~*; write a/be Z, 
modulo p” in the form ay + a,p + --- + ay_,p"~!; and compute the coefficients of 
(1 + X)sot art +av-1P"~* modulo pi. 

21. (6) First prove that the convergence assumptions allow you to rearrange terms. 
Reduce everything to proving than an element in R[[X]] which vanishes for values 
of the variables in [—«’, e’] must be the zero power series. Prove this fact by induction 
on n. 

CHAPTER IV §2 

1. 6-742-77 +5-79; 2% 4 25 + 26 + 28 + 29 4 219 4 211, 

2. By removing roots of unity, show that the image of Z, is the same as the image of 
1 + pZ, for p > 2 and 1 + 4Z, for p = 2. 

3. p*|log, a<>p?|(p — Ilog, a = log,(a’~'), and the latter is congruent to 
a’~' — 1 mod p?, since in general log,(1 + x) = x mod p? for xe pZ,. 

4. log, x (no surprise!). 

5. Let c = f(p), and show that f(x) — c ord, x satisfies all three properties which char- 
acterize log, x. 

8. | fC. + p®) -f)|, =|-1- 11, =1. 


9. 7? -1=( + DY — 1), and for p > 2 exactly one of the two factors is divisible by 


10. 


12. 
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p, and hence by p’. If p = 2, then you have j = +1 mod 2%7!. 


Approximate 1/2 by (p% + 1)/2, and compare ([] j<¢px+19/2, prj J)” With []j<py, pr jhe 
which we proved is = —1 (mod p¥). 


In the first equality both sides are 1 + 3-5 +2-574+ 3-53 +4---, and in the 
second equality both sides are 1 +6-7+5-77+4-7P +--+. 


13. 


14. 
15. 
16. 
17. 
18. 
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On the right side of property (3), we have so = p —r, s, = (p—1-—n/(1 — p). 
Note that the expression is congruent to m!~® mod p; then it remains to verify that 
the (p — 1)th power is 1. Use: (p — 1)s, =1—p+r=1-— 5p. 


In all cases show that the image of the function is in the open unit disc about 1. 
1+ X 4+ X74 3X3 4 3x4; 4X 44X74 4X3 4 3x4 

The coefficient of X” is ((p — 1)! + 1)/p!; Wilson’s theorem. 

E,(X?/E(X)? = e °* 1 + pXZ,[[X]]. 

Sf (Xf (XP = exp(> 2 o(b;-. — pb) X”). If c; = b;_-, — pb;€ pZ, for all i, then 


! def 
since e* € 1 + pXZ,|[X]] whenever ce pZ,, it follows that 


Tle" 61+ pXZ,{[X]]. 


Conversely, suppose c;, is the first c; not in pZ,; then the coefficient of X?*° in 
TI e*” is congruent mod p to c,, # 9 mod p, and by Dwork’s lemma f(X) ¢ 
1+ XZ,[L[X]]. 


CHAPTER IV §4 


1. 


2. 


3. 


4. 


(i) Join (0, 0) to (1, 0) and (1, 0) to (2, 1). 
(ii) Join (0, 0) to (3, —2). 
(iii) Join (0, 0) to (2, 0), (2, 0) to (4, 1), and (4, 1) to (6, 3). 
(iv) Join (0, 0) to (p — 2, 0) and (p — 2, 0) to (p — 1, 1). 
(v) Join (0, 0) to (1, 0), (1, 0) to (2, 1), and (2, 1) to (3, 4). 
(vi) Join (0, 0) to (p? — p, 0), (p? — p, 0) to (p? — 1, p — 1), and (p? — 1, p — 1) 
to (p?, p + 1). 


(a) Any root « of f(X) that satisfied a polynomial of lower degree d would have 
ord, « equal to a fraction with denominator at most d; but by Lemma 4, ord, a = 
—m/n. 

(b) If f(X) is an Eisenstein polynomial, then a, 1X"f(1/X) = 14+ a,_,/a,X +--+: 
+ Ao/a,X" has for its Newton polygon the line joining (0, 0) to (n, 1). 

(c) Counterexample: 1 + pX + ap?X?, where ae Z} is chosen so that | — 4a is 
not a square in Z,, 


All slopes are between 0 and 1, and for each segment of slope A there’s a segment (of 
the same horizontal length) with slope 1 — A. The number of possible Newton 
polygons of this type is: 2 forn = 1; 3 forn = 2;5 forn = 3;8 forn = 4. 


(i) Join (pi — 1, —j) to (pit! — 1, —G + 1) forj = 0, 1,2,.... 
(ii) The horizontal line from (0, 0). 
(iii) Join (pi — 1,1 + p+--- +p! —j)to 
(Pitt —1ltpt---t+ pi t+ pi —(G+ 1) forj =0,1,2,... 
(iv) One infinite straight line from (0, 0) with slope —1/(p — 1). 
(v) The segment joining (0, 0) to (2, 1) and the infinite line from (2,1) with slope 1. 
(vi) The infinite line from (0, 0) with slope 1. 
(vii) Join (7,1 +2+---+(j-1)) to (G4+1,14+24+---+(G—1)4+,) for 
J=0,1,2,.... 
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5. 


10. 


11. 


12. 


(viii) Starting with a segment from (0, 0) to (2, 2), there are infinitely many segments 
with slope increasing toward =f ; the details of these segments depend upon 
rational approximations to pa: 


The Newton polygon of 1 + >, p!*"¥71X' is the infinite line from (0, 0) with 
slope ee 


For example, > 2  p'X”'~ | converges on D(1), its Newton polygon is the horizontal 
line from (0, 0), and it has no zero in D(1). 


Reduce to the case A = 0 by replacing f(X) by f(p~*X), where p* is a choice of 
fractional power of p in Q. Then multiply by a scalar to reduce to the case when 


min ord,a; = 0. 


For x € D(1) clearly | f(x)|, < 1. To obtain an x for which | f(x)|, = 1, it suffices 
to reduce modulo the maximal ideal of Q, i.e., to consider the series f(X) € F,[X]. 
(This has only finitely many terms, because ord, a; > 00.) Then choose any x such 
that f(x) # Oin F,. 


. Apply the Weierstrass Preparation Theorem to the series f,(X) = f(X)/a,X"€ 1 


+ XOQ[LX]] which is obtained by dividing f(X) by its leading term a, X". Take A = 0. 


Reduce to the case f(X)e1 + XQ([X]] by factoring out the leading term, as in 
Exercise 9. Use the Weierstrass Preparation Theorem to write h(X) = f(X)g(X). 
But f(x) = 0 implies that h(x) = 0, and h(X) is a polynomial. 


Use Exercises 9 and 10, and show that if E, has one zero, then it must have infinitely 
many. To do this, let x be any pth root of a zero of E,, and use the relation E,(X)? = 
E,(X?)e?*. 


Write f(X)g(X) = A(X). If f(X) has a coefficient a; with ord, a; < 0, then, by 
Lemma 4, f(X) has a root « in D(1~). But then h(a) = 0; however, f and h have no 
common roots. If h(X) has a coefficient a; with ord, a; < 0, then it has a root « in 
D(17). Since g(a) # 0 it follows that f(«) = 0, and we again have a contradiction. 
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1. 
2. 
3. 


4. 


5. 
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Il — T); 1/(1 — qT). 
1/1 — TX1 — qT)--- (1 — @"T). 


There is a one-to-one correspondence between the points of (n — 1)-dimensional 
affine space and the points of H, given by 


, eseoetg XD OX, see Xn-ts —g(m1, Doar) Xn-1))- 


Suppose, for example, that r = 2. Then show that + Hy,, | 7.y(Fqs) =H Hy, (F gs) 
+H£H,,(F gs) —H¢ Hy, 5,(F qs), where H,,;, is the hypersurface defined by the 
product of the two polynomials. 


Write an n-dimensional projective hypersurface as a disjoint union of affine hyper- 
surfaces (one in each dimension n, n — 1,n — 2,...). 


6. 


10. 


11. 


12. 


13. 


14, 


Answers and hints for the exercises 


Let Q(T) = T] (1 — oT), P(T) =T] (1 — B;T); these are both in Q[T]; then 
exp(> N,T*/s) = P(T)/Q(T). The converse is shown by reversing this procedure. 


. Comparing coefficients of T gives a= N, —1-—q, where N, = + E(F,); by 


Exercise 6, N, = 1 + q° — o§ — aS, where «,, «, are given by (1 + aT + qT’) = 
(1 — a,T)\1 — a, T). 


. (i) For q = 2 (mod 3), every element of F, has a unique cube root; then show that 


N, = 4 + 1,sothata = 0,and the zeta-function is (1 + gT?)/(1 — T)(1 — qT). 

(ii) For g = 3 (mod 4), —1 does not have a square root. Then for exactly one from 
each pairx, = a,x, = —awehaveasolution tox, = +, /x3 — x,;this gives a 
one-to-one correspondence between ae F, and points (x,, x2) on the affine 
curve. Counting the point at infinity, we obtain N = q + 1, as in part (i), and the 
zeta-function is (1 + qT7)/(1 — T)(1 — qT). Next, for q = 9 = 3?, we have N, 
= (the N, when q = 3) = 16,a = 6, Z(T) = (1 + 3T)°*/(1 — T)1 — 9T). For 
q = 5 we have Z(T) = (1 + 27 + 5T*)((1 — T)(1 — 5T), and for q = 13 we 
have Z(T) = (1 — 6T + 13T?)(1 — T)(1 — 13T). 


. Suppose we have two rational functions f(T)/g(T) and u(T)/v(T) with numerator 


of degree m and denominator of degree n, where the first is exp(> 22, N,T*/s) and 
the second is exp(> 2, N; T°/s), and suppose that N, = Njfors = 1,2,...,m+n. 
It suffices to show that then f(T)/g(T) = u(T)/v(T), because that implies that 
N, = N; for all s. But f(T)o(T) = g(T)u(T) exp(2 2 (Ns — N5)T*/s) = g(T)u(T) 
X eXP(> 2 minti(Ns — Nj)T*/s), and the equality of polynomials comes from 
comparing coefficients of powers of T up to T”*". 


Show that there are (q — 1)q? four-tuples with nonzero x3 and (q — 1)q with zero 
x3; then the zeta-function is (1 — qT)/(1 — q°T). 


The zeta-function of the affine curve is listed first, followed by the zeta-function of its 
projective completion: 
(i) (Q — TYG — qT)’; 1/1 — TA — qTY. 
(ii) (1 — T° — qT)’; 1/1 — qT)’. 
(iti) (1 — T)/(1 — qT) (unless p = 2, in which case it is 1/(1 — qT)); 


1/ — T) —- qT). 


(iv) 1/4 — qT); 1/4 — T)1 — qT). 
(v) (1 — T)/((l — gT); 1/( — qT) (unless p = 2, in which case 1/(1 — qT); 
1/1 — T)( — 4T)). 


1 — TX — qT) — TP — qT — q*T). 


It suffices to show that for any prime p, the coefficients (which are a priori in Q) are 
in Z,. 


Write the numerator in the form 1 + a,7 + a,T? + a,T? + p*T*. To show 
that a, = a, = a; = 0,i.e., that the zeta-function agrees with the zeta-function of 
the projective line through the T-term, it suffices to show that N, = p° + 1 for 
s = 1, 2, 3. But since p* $ 1 (mod 5) every element of F,. has a (unique) Sth root. 
(This is the same procedure as in Exercise 8(i).) 
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Answers and hints for the exercises 


CHAPTER V §4 
1. See the proof of the second proposition in §III.3. 


2. Since @(T) = > a;T/ with ord,a;> jp — 1), we have O(aXy!---X}") = 
2 v=jwSoX” with ord, g, = |v|/(|w|(p — 1)). Set M = 1/(|w|(p — 1)); then ord, g, 
> MII. 


3. Use induction on the number of nonzero a;’s; in case of difficulty, see Lang’s Algebra, 
p. 209. 


4. Make a change of variables x +> x + x9, where xo € F, has nonzero trace. 


5. For u # 0 make the change of variables x t+ ux in Exercise 4. 
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